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(57) Abstract 

An isolated and purified DNA molecule encoding hyaluro- 
nan synthase-2 (Has2) is provided, as is purified and isolated 
Has2 polypeptide. Also provided is an isolated and purified DNA 
molecule encoding hyaluronan synthase-3 (Has3), as is purified 
and isolated Has3 polypeptide. 
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GENE ENCODING HYALURONAN SYNTHASE 

5 

Background of the Invention 

Hyaluronan (HA, hyaluronic acid) is a linear unbranched polymer made 
up of repeating disaccharide units of D-glucuronic acid (pi^ 3) N- 
acetylglucosamine (pi-4). HA biosynthesis requires two enzyme activities; the 

1 0 transfer of UDP-N-acetylglucosamine (UDP-GlcN Ac) and UDP-glucuronic acid 
(UDP-GlcUA), respectively, to the growing HA chain. HA is synthesized at the 
inner face of the plasma membrane and is subsequently extruded to the outside 
of the cell. HA is a major constituent of the extracellular matrix during 
embryonic development. For example, within the developing embryo, HA 

15 accumulates at sites of cell migration and proliferation, and has been proposed to 
play important roles in craniofacial, limb, neural tube, and heart development. In 
particular, HA is essential for the formation of endocardial cushions, the 
structures required for septation and the development of heart valves. In adults, 
HA is a major constituent of the extracellular matrix of most tissues and organs, 

20 and a critical component of the vitreous humor of the eye, joint fluid and 
cartilage. 

HA is highly biocompatible and completely biodegradable, and has 
demonstrated beneficial effects when administered to the joints of arthritic race 
horses and to perforated rat tympanic membranes. HA has also been employed 

25 to protect eye tissue during artificial intraocular lens implantations, as a delivery 
agent for drugs and to prevent post-operative scarring. 

Genes which encode HA biosynthetic enzymes have been identified in 
bacteria, e.g., Group A Streptococcus (Wessels et al., Infect, lmmun.. 62 . 433 
(1994); DeAngelis et al., J. Biol. Chem. . 268. 19181 (1993); DeAngelis et al., 

30 Biochemistry . 33 . 9033 (1994)). Polymerization of HA by S. pyogenes occurs 
through the action of a single enzyme, HA synthase, encoded by the hasA gene. 
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The S. pyogenes HA synthase is localized to the membrane and is predicted to 
have several transmembrane domains and a large intracellular loop 
encompassing the active site of the enzyme. Purified immobilized HasA has 
been shown to be sufficient for HA polymerization in vitro (DeAngelis et al., 
5 Biochemistry , 33 , 9033 (1994)). The transfer of the hasA gene and a second 
gene, hasB, into heterologous bacterial species results in the synthesis of an HA 
capsule (DeAngelis et al., J. Biol. Chem. . 268 . 19181 (1993)). The hasB gene 
encodes a UDP-glucose dehydrogenase, which converts UDP-glucose to UDP- 
glucuronic acid (UDP-GlcUA), a subunit of HA. 

10 However, there is evidence that other genes are also involved in bacterial 

HA biosynthesis. A protein originally identified in Streptococcus equisimilis as 
HA synthase (Lansing et al., Biochem. J. . 2S2> 179 (1993)) has no sequence 
similarity to S. pyogenes HasA but has significant sequence similarity to 
bacterial proteins involved in oligopeptide binding and transport. Although the 

15 total amount of HA synthesized by bacterial cells overexpressing the S. 

equisimilis HA synthase increased, the length of the resultant HA chains was 
significantly shorter, suggesting that the increase may be a function of an 
elevation in the rate of HA transport from the cell (O'Regan et al., Int. J. Biol. 
Macromol. . 16 . 283 (1994)). Thus, rather than being directly involved in HA 

20 biosynthesis, the S. equisimilis HA synthase may be involved in the transport of 
HA, or may participate in HA synthesis as an accessory molecule, rather than as 
the synthase itself. 

While both bacterial and animal sources of HA exist, high molecular 
weight HA is difficult and costly to isolate and purify due to the fact that HA is 

25 complexed with proteoglycans. Moreover, both bacterial and animal sources of 
HA are increasingly under more stringent regulatory controls due to fear of 
contamination with identifiable, or as yet unidentified, infectious or toxic agents. 
Furthermore, the extensive purification process of HA polymer from cells results 
in an HA polymer of considerable molecular weight polydispersity. 
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Thus, there is a need to isolate and purify genes that encode eukaryotic 
HA biosynthetic enzymes or proteins associated with the extracellular 
accumulation of HA. 

Summary of the Invention 

5 The present invention provides an isolated and purified DNA molecule 

comprising a preselected DNA segment encoding eukaryotic, preferably 
mammalian, hyaluronan synthase-2 (Has2), a biologically active variant thereof 
or a biologically active subunit thereof. A preferred embodiment of the 
invention is a DNA molecule comprising a preselected DNA segment, e.g., SEQ 

1 0 ID NO: 1 , that encodes murine hyaluronan synthase-2. A murine hyaluronan 
synthase-2 having SEQ ID NO:2 has 21% identity and 28% similarity to 
Streptococcal HasA, and 55% identity and 73% similarity to murine Hasl (Itano 
et al., J. Biol. Chem. . 271 . 9875 (1996); SEQ ID NO:3). Because the deduced 
amino acid sequence of Hasl is distinct from the murine hyaluronan synthase-2 

1 5 having SEQ ID NO:2, there appears to be more than one mammalian gene 

encoding an enzyme or protein which is associated with HA biosynthesis and/or 
extracellular HA accumulation. Another preferred embodiment of the invention 
is a DNA molecule comprising a preselected DNA segment, e.g., SEQ ID 
NO:23, that encodes human hyaluronan synthase-2, a polypeptide which does 

20 not have amino acid sequence identity with the human homolog of murine Has 1 
(Itano et al., BBRC . 222. 816 (1996); SEQ ID NO:55). Also provided is an 
isolated and purified DNA molecule comprising a preselected DNA segment 
which encodes a protein that increases the amount of extracellular hyaluronan 
produced by cultured primate cells transformed so as to express said DNA 

25 segment. 

Further provided is an isolated and purified DNA molecule comprising a 
DNA segment encoding eukaryotic, preferably mammalian, hyaluronan 
synthase-3 (Has3), or a biologically active variant thereof or a biologically active 
subunit thereof. A preferred embodiment of the invention includes a preselected 
30 DNA segment comprising SEQ ID NO:3 1 which encodes a hyaluronan synthase- 
3 comprising SEQ ID NO:32. Another preferred embodiment of the invention 
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includes a DNA molecule comprising a preselected DNA segment comprising 
SEQ ID NQ:25 which encodes a hyaluronan synthase-3 comprising SEQ ID 
NO:29. The DNA molecules of the invention are double-stranded or single- 
stranded, preferably, they are cDNA. 
5 An isolated and purified DNA molecule, such as a probe or a primer, e.g., 

an oligonucleotide, of at least seven, preferably at least fifteen, nucleotide bases 
which hybridizes under stringent conditions to the DNA molecules of the 
invention, or RNA molecules derived from these DNA molecules, is also 
provided by the invention. The term "stringent conditions" is defined 

1 0 hereinbelow. The probes or primers of the invention have at least about 80%, 
preferably at least about 90%, identity to the above-disclosed DNA sequences, or 
sequences complementary thereto. A preferred embodiment of the invention 
includes a probe or primer which has at least about 80%, preferably at least about 
90%, more preferably at least about 95%, identity to 1) SEQ ID NO:l, 2) SEQ 

1 5 ID NO:23, 3) SEQ ID NO:25, 4) SEQ ID NO:26 or 5) SEQ ID NO:3 1 , or a 

sequence complementary thereto. The probes or primers of the invention may be 
detectably labeled or have a binding site for a detectable label. The probes or 
primers are useful to detect, quantify and/or amplify DNA strands with 
complementary to sequences related to hyaluronan synthase-2 or hyaluronan 

20 synthase-3 in eukaryotic tissue samples. The probes and primers of the present 
invention are also useful for detecting RNA molecules resulting from 
transcription of the DNA molecules of the present invention. The uses of probes 
and primers, as well as their isolation, purification and conditions under which 
they are employed for the detection or amplification of a specific gene, are well 

25 known in the art. 

The present invention also provides isolated and purified DNA molecules 
which provide "anti -sense" mRNA transcripts of the DNA sequences, including 
SEQ ID NO: 1 or SEQ ID NO:3 1, which, when expressed from an expression 
cassette in a host cell, can alter HA expression. 

30 The present invention also provides an expression cassette comprising a 

promoter which is functional in a host cell operably linked to a preselected DNA 
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segment encoding hyaluronan synthase-2. Preferably, the expression cassette 
comprises a preselected DNA segment encoding murine hyaluronan synthase-2. 
Another preferred embodiment of the invention is an expression cassette 
comprising a preselected DNA segment encoding human hyaluronan synthase-2. 
5 The present invention further provides an expression cassette comprising 

a promoter which is functional in a host cell operably linked to a preselected 
DNA segment encoding hyaluronan synthase-3. Preferably, the expression 
cassette comprises a preselected DNA segment encoding murine hyaluronan 
synthase-3. Another preferred embodiment of the invention is an expression 

10 cassette comprising a preselected DNA segment encoding human hyaluronan 
synthase-3. Such expression cassettes can be placed into expression vectors 
which can then be employed to transform prokaryotic or eukaryotic host cells. It 
is envisioned that the vectors of the invention may be useful to transform 
mammalian cells in vivo, or in vitro with subsequent introduction of the 

1 5 transformed cells to a host organism. The in vivo delivery of the vectors may be 
accomplished by methods well known to the art, including, but not limited to, 
viral- or Iiposome-mediated delivery. The present cassettes can also contain a 
functional DNA sequence which is a selectable marker gene or reporter gene, as 
described below. 

20 Also provided is a transformed host cell, the genome of which has been 

augmented by a preselected DNA sequence encoding hyaluronan synthase-2, a 
preselected DNA sequence encoding hyaluronan synthase-3, or a combination 
thereof. Preferably, the preselected DNA sequence is integrated into the 
chromosome of the transformed host cell, and is heritable. 

25 Expression of mouse hyaluronan synthase-2 or mouse hyaluronan 

synthase-3 in COS-1 cultured primate cells results in the formation of large well- 
pronounced HA coats, as described hereinbelow. Moreover, HA coat formation 
in COS cells transfected with an hyaluronan synthase-2 expression vector 
occurred in the absence of HA receptor expression, exogenously added HA, or 

30 proteoglycans. This suggests that hyaluronan synthase-2 expression leads to the 
synthesis of HA, in a form which is extruded through the plasma membrane and 
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may associate with the cell surface to form an HA coat through continued 
attachment to the HA synthase. 

Further provided is isolated, purified hyaluronan synthase-2 polypeptide. 
A preferred embodiment of the invention is isolated, purified murine hyaluronan 
5 synthase-2 polypeptide. Another preferred embodiment of the invention is 
isolated, purified hyaluronan synthase-2 polypeptide having SEQ ID NO:2. 

Also provided is isolated, purified hyaluronan synthase-3 polypeptide. A 
preferred embodiment of the invention is isolated, purified murine hyaluronan 
synthase-3 polypeptide. Another preferred embodiment of the invention is 

10 isolated, purified hyaluronan synthase-3 polypeptide having SEQ ID NO:32. 

As used herein, the term "Has2" or "hyaluronan synthase-2" is preferably 
defined to mean a polypeptide comprising SEQ ID NO:2, as well as variants of 
SEQ ID NO:2 which have at least about 80%, preferably at least about 90%, 
identity or homology to SEQ ID NO:2, or a biologically active subunit thereof. 

15 Biologically active subunits of hyaluronan synthase-2, variant hyaluronan 

synthase-2 polypeptides and biologically active subunits thereof, falling within 
the scope of the invention have at least about 50%, preferably at least about 80%, 
and more preferably at least about 90%, the activity of the hyaluronan synthase-2 
polypeptide comprising SEQ ID NO:2. The activity of an hyaluronan synthase-2 

20 polypeptide can be measured by methods well known to the art including, but 
not limited to, the particle exclusion assay described hereinbelow, an 
immunoassay which detects HA production, as described by Itano et al. ( J. Biol. 
Chem.. 271. 9875 (1996)), HA synthase activity of crude membrane 
preparations, as described by Itano et al. (supra), or HA synthase activity of cell 

25 lysate preparations, as described by Meyer et al. f Proc. Natl. Acad. Sci. USA . 93 . 
4543 (1996)). 

As used herein, the term "Has3" or "hyaluronan synthase-3" is preferably 
defined to mean a polypeptide comprising SEQ ID NO:32, SEQ ID NO:29, or a 
biologically active subunit thereof, as well as variants of SEQ ID NO:32 or SEQ 
30 ID NO:29 and subunits thereof which have at least about 80%, preferably at least 
about 90%, identity or homology to SEQ ID NO:32 or SEQ ID NO:29, 
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respectively. Biologically active subunits of hyaluronan synthase-3, variant 
hyaluronan synthase-3 polypeptides and biologically active subunits thereof, 
falling within the scope of the invention have at least about 50%, preferably at 
least about 80%, and more preferably at least about 90%, the activity of the 
5 hyaluronan synthase-3 polypeptide comprising SEQ ID NO:32 or SEQ ID 

NO:29. The activity of an hyaluronan synthase-3 polypeptide can be measured 
by the methods described above for hyaluronan synthase-2. 

The present invention also provides a method to produce hyaluronan 
synthase-2, comprising: culturing a host cell, preferably a primate host cell, 

10 transformed with a nucleic acid molecule comprising a DNA segment encoding 
hyaluronan synthase-2 operably linked to a promoter, so that said host cell 
expresses said hyaluronan synthase-2. The method also preferably provides 
isolated recombinant hyaluronan synthase-2 polypeptide which is recovered 
from the transformed host cells. 

1 5 Also provided is a method to produce hyaluronan synthase-3, 

comprising: culturing a host cell transformed with a nucleic acid molecule 
comprising a DNA segment encoding hyaluronan synthase-3 operably linked to 
a promoter, so that said host cell expresses said hyaluronan synthase-3. The 
method also preferably provides isolated recombinant hyaluronan synthase-3 

20 polypeptide which is recovered from the transformed host cells. Optionally, host 
cells can be co-transformed with a nucleic acid molecule comprising a DNA 
segment encoding hyaluronan synthase-3 operably linked to a promoter and a 
nucleic acid molecule comprising a DNA segment encoding hyaluronan 
synthase-2 operably linked to a promoter. 

25 Further provided is a method of altering the amount of hyaluronan 

produced by a cell. The method comprises introducing into a host cell a 
preselected DNA segment encoding hyaluronan synthase-2 operably linked to a 
promoter so as to yield a transformed host cell. The preselected DNA segment is 
expressed as hyaluronan synthase-2 in the transformed host cell in an amount 

30 that results in the transformed host cell producing an altered, preferably 
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increased, amount of hyaluronan relative to the amount of hyaluronan produced 
by a corresponding untransformed host cell. 

Also provided is a method of altering the amount of hyaluronan produced 
by a cell. The method comprises introducing into a host cell a preselected DNA 
5 segment encoding hyaluronan synthase-3 operably linked to a promoter so as to 
yield a transformed host cell. The preselected DNA segment is expressed as 
hyaluronan synthase-3 in the transformed host cell in an amount that results in 
the transformed host cell producing an altered, preferably increased, amount of 
hyaluronan relative to the amount of hyaluronan produced by a corresponding 

1 0 untransformed host cell. 

Once isolated and purified, the genes involved in HA biosynthesis and 
extracellular accumulation of HA can be employed to synthesize HA in vitro. 
Because in vitro synthesized HA is of extremely high purity, is free from 
bacterial and animal cell contaminants, and can be optimized as to its 

15 physicochemical properties, it is a preferred source of HA relative to HA derived 
from bacterial or animal sources. Thus, the invention provides a method to 
prepare HA which comprises contacting an amount of hyaluronan synthase-2, an 
amount of hyaluronan synthase-3, or a combination thereof, with a mixture of 
components under conditions effective to yield hyaluronan. 

20 Moreover, the identification of genes involved in HA biosynthesis and/or 

coat formation may also be useful for defining the molecular basis for genetic 
diseases which are associated with a deficiency in HA biosynthesis, such as 
cartilage pathologies, for providing a clinically useful diagnostic test or in 
molecular-based therapeutics. Furthermore, the cloning of these genes will help 

25 to elucidate the molecular mechanism giving rise to the alteration of the protein 
encoded by the gene in patients having a particular disorder, e.g., a cartilage 
deficiency associated with reduced HA biosynthesis. 

Thus, the invention provides a method to prevent or treat a condition 
associated with an alteration in HA synthesis or extracellular accumulation. The 

30 method comprises administering to a mammal afflicted with, or at risk of, said 
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condition an amount of mammalian hyaluronan synthase-2 effective to alter HA 

synthesis or extracellular accumulation. 

The invention also provides a method to prevent or treat a condition 

associated with an alteration in HA synthesis or extracellular accumulation, 
5 comprising: administering to a mammal afflicted with, or at risk of, said 

condition an amount of mammalian hyaluronan synthase-3 effective to alter HA 

synthesis or extracellular accumulation. 

Also provided is a method to identify a mammal afflicted with, or at risk 

of, a condition associated with aberrant HA synthesis or extracellular 
10 accumulation. The method comprises contacting an agent that binds to 

mammalian hyaluronan synthase-2 with a mammalian sample suspected of 

containing hyaluronan synthase-2 so as to form a complex. Then the presence or 

amount of complex formation is detected or determined and the presence or 

amount of complex formation is correlated with the presence or absence of the 
15 condition. 

The invention also provides a method to identify a mammal afflicted 
with, or at risk of, a condition associated with aberrant HA synthesis or 
extracellular accumulation which employs an agent that binds to mammalian 
hyaluronan synthase-3. The agent is contacted with a mammalian sample 

20 suspected of containing hyaluronan synthase-3 so as to form a complex. The 
presence or amount of complex formation is detected or determined and the 
presence or amount of complex formation is correlated with the presence or 
absence of the condition. 

Further provided is a method for detecting hyaluronan synthase-2 DNA. 

25 The method comprises contacting an amount of DNA obtained by reverse 

transcription of RN A from a mammalian physiological sample which comprises 
cells suspected of containing hyaluronan synthase-2 RNA, with an amount of at 
least two oligonucleotides under conditions effective to amplify the DNA by a 
polymerase chain reaction so as to yield an amount of amplified hyaluronan 

30 synthase-2 DNA. At least one oligonucleotide is an hyaluronan synthase-2- 
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specific oligonucleotide. The presence or amount of the amplified hyaluronan 
synthase-2 DNA is then detected. 

The invention also provides a method for detecting hyaluronan synthase- 
3 DNA. The method comprises contacting an amount of DNA obtained by 
5 reverse transcription of RNA from a mammalian physiological sample which 
comprises ceils suspected of containing hyaluronan synthase-3 RNA, with an 
amount of at least two oligonucleotides under conditions effective to amplify the 
DNA by a polymerase chain reaction so as to yield an amount of amplified 
hyaluronan synthase-3 DNA. At least one oligonucleotide is an hyaluronan 

1 0 synthase-3-specific oligonucleotide. The presence or amount of the amplified 
hyaluronan synthase-3 DNA is detected. 

As used herein, the term "hyaluronan synthase-2 -specific 
oligonucleotide" or "hyaluronan synthase-3-specific oligonucleotide" means a 
DNA sequence that has at least about 80%, preferably at least about 90%, and 

1 5 more preferably at least about 95%, sequence identity with SEQ ID NO: 1 or 
SEQ ID NO:23 (has2), or SEQ ID NO:25, SEQ ID NO:29 or SEQ ID NO:32 
(has3), respectively. An oligonucleotide or primer of the invention has at least 
about 7-50, preferably about 10-40, and more preferably about 15-35, 
nucleotides. Preferably, the oligonucleotide primers of the invention comprise at 

20 least 7 nucleotides at their 3' end which have at least about 85% identity to SEQ 
ID NO: 1, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:29 or SEQ ID NO:32. 
The oligonucleotides of the invention may also include sequences which are 
unrelated to has sequences. 

Further provided is a method for detecting a condition associated with 

25 aberrant HA synthesis or extracellular accumulation. The method comprises 
contacting an amount of DNA obtained by reverse transcription of RNA from a 
mammalian physiological sample which comprises cells suspected of containing 
hyaluronan synthase-2 RNA, with an amount of at least two oligonucleotides 
under conditions effective to amplify the DNA by a polymerase chain reaction so 

30 as to yield an amount of amplified hyaluronan synthase-2 DNA. Alternatively, 
or concurrently, an amount of DNA obtained by reverse transcription of RNA 
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from a mammalian physiological sample which comprises cells suspected of 
containing hyaluronan synthase-3 RNA is contacted with an amount of at least 
two oligonucleotides under conditions effective to amplify the DNA by a 
polymerase chain reaction so as to yield an amount of amplified hyaluronan 
5 synthase-3 DNA. Then the presence or amount of the amplified hyaluronan 
synthase-2 and/or hyaluronan synthase-3 DNA is detected. The presence or 
amount of hyaluronan synthase-2 DNA is indicative of the presence of the 
condition in said mammal and/or hyaluronan synthase-3. 

The invention also provides a therapeutic method in which an amount of 

10 an agent that alters the activity of native hyaluronan synthase-2, native 

hyaluronan synthase-3, or a combination thereof, is administered to a mammal. 

Brief Descrip tion of the Figures 
Figure 1. Degenerate RT-PCR analysis. An agarose gel is shown which 
depicts polymerase chain reaction (PCR) amplified bands characteristic of a 

15 typical RT-PCR experiment. RT-PCR was performed on total RNA isolated 
from 10.5 days post coitum (dpc) (E 10.5) and 14.5 dpc (E 14.5) C57BL/6J 
mouse embryos. M, indicates 1 kilobase pair ladder (GIBCO-BRL/Life 
Technologies, Gaithersburg, MD). DEG1/3 indicates degenerate primer pools 1 
and 3. DEG 1/5 indicates degenerate primer pools 1 and 5. 

20 Figure 2. cDNA library clones. The extent of overlapping cDNA clones 

is shown in relation to the mouse Has2 cDNA and to the degenerate RT-PCR 
mouse Has2 cDNA clone, MHas300. The positions of the translation initiation 
codon (ATG), the translation termination codon (TGA), and the internal EcoRI 
restriction endonuclease site (E) are indicated. 

25 Figure 3. Nucleotide sequence encoding, and corresponding amino acid 

sequence of, mouse Has2 (SEQ ID NO:l and SEQ ID NO:2, respectively). The 
5' and 3* untranslated nucleotide sequences are shown in lowercase, whereas the 
open reading frame is shown in uppercase. The stop codon, consensus 
polyadenylation signals, CA repeat and TA repeat are underlined. 

30 Figure 4. Alignment of mouse Has2 with mouse Hasl (Itano et al., L 

Biol. Chem. . 271 . 9875 (1 996)) (SEQ ID NO:3), Xenopus laevis DG42 (SEQ ID 
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NO:4), Streptococcus pyogenes HasA (SEQ ID N0:5), and Rhizobium meliloti 
NodC (SEQ ID NO:6). Identical residues are boxed. Dashes indicate gaps that 
have been introduced to maximize the identity. Asterisks below the line indicate 
positions at which there have been conservative amino acid substitutions, 
5 Figure 5. Alignment of two regions of mouse Has2 (SEQ ID NOs:7 and 

8) with equivalent regions of mouse Hasl (Itano et al., supra) (SEQ ID NO:9 
and SEQ ID NO:40), X laevis DG42 (SEQ ID NO: 10 and SEQ ID NO:42), 5 
pyogenes HasA (SEQ ID NO.l 1 and SEQ ID NO:44), R. meliloti NodC (SEQ ID 
NO: 12 and SEQ ID NO:46) and S. cerevisiae chitin synthase 2 (Chs2) (SEQ ID 
1 0 NO: 1 3 and SEQ ID NO:45). Dashes represent gaps that have been introduced to 
maximize homology. Residues highlighted in bold type are those that have been 
demonstrated to be critical in terms of enzyme activity of Chs2 (see Nagahashi et 
aL, J. Biol. Chem.. 270 , 1 3961 (1995)) and that are conserved in all six 
sequences. 

1 5 Figure 6. Kyte-Doolittle hydrophilicity plots and linear cartoon 

representation of mouse Has2 protein. A) Comparison of mouse Has2, mouse 
Hasl and Streptococcus pyogenes HasA by Kyte-Doolittle hydrophilicity plots. 
The amino acid sequences of mouse Has2, mouse HAS (Hasl ) and bacterial 
HasA were analyzed using the Kyte-Doolittle algorithm (MacVector) with a 

20 hydrophilicity window size of 15. Strongly hydrophobic areas of the proteins 
are indicated below the axes. Areas predicted to be potential transmembrane 
domains or signal peptide are indicated by the black bars below each plot. B) 
Linear representation of mouse Has2 predicted protein. Hydrophobic areas are 
indicated by the filled black boxes. Consensus B(X 7 )B HA binding motifs 

25 (HABM) are indicated by the filled gray boxes and are numbered. These motifs 
correspond to amino acid residues 100-108, 107-1 15, 420-428, and 460-468. 
The predicted intracellular loop of the molecule is indicated. 

Figure 7. Northern analyses of mouse Has2 expression. Multiple tissue 
Northern blots of polyA + RNA isolated from mouse embryos and adult tissues 

30 were hybridized with a mouse Has2 ORF cDNA probe. The relative positions of 
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RNA molecular weight markers are indicated at the left of each blot. A GAPDH 
probe was employed as an internal control. 

Figure 8. Southern analysis of mouse Has2. Total 129Sv/J mouse 
genomic DN A was digested with the restriction enzymes, E (EcoRJ), B 
5 (BamHI), H (Hindlll), and S (SacI) and probed with a labeled mouse Has2 ORF 
cDNA. U M" indicates 1 kilobase pair ladder. 

Figure 9. COS-1 cells expressing mouse Has2 hyaluronan coats. HA 
coats were detected by a particle exclusion assay (see Clarris et aL, Exp. Cell 
Res. . 49, 181 (1986)). (A) Mouse 3T6 embryonic fibroblasts. (B) COS-1 cells. 
10 (C) COS-1 cells co-transfected with a P-gal expression vector and pCIneo 

control vector. (D-I) COS-1 cells co-transfected with a vector which expresses 
mouse Has2 and a vector which expresses p-gal. (E) Co-transfected COS-1 cells 
which were maintained in starvation-medium. (F and I) Co-transfected COS-1 
cells stained for P-gal activity. (H) Co-transfected COS-1 cells which were 
15 maintained in starvation-medium containing hyaluronidase. 

Figure 10. (A) Partial nucleotide sequence of human hyaluronan 
synthase-2 (SEQ ID NO:23). (B) Nucleotide sequence alignment of human 
hyaluronan synthase-2 (SEQ ID NO:23) and mouse hyaluronan synthase-2 (SEQ 
ID NO: 1). (C) Amino acid sequence alignment of human hyaluronan synthase-2 
20 (SEQ ID NO:24) and mouse hyaluronan synthase-2 (SEQ ID NO:2). 

Figure 1 1. (A) Partial nucleotide sequence of human hyaluronan 
synthase-3 (SEQ ID NO:25). (B) Partial nucleotide sequence of murine 
hyaluronan synthase-3 (SEQ ID NO;26). (C) Nucleotide sequence alignment of 
human hyaluronan synthase-3 (SEQ ID NO:25) and mouse hyaluronan synthase- 
25 3 (SEQ ID NO:26). (D) Amino acid sequence alignment of human hyaluronan 
synthase-3 (SEQ ID NO:27) and mouse hyaluronan synthase-3 (SEQ ID NO:28). 

Figure 12. (A) Amino acid sequence alignment of a partial sequence for 
human hyaluronan synthase-3 (Has3) (SEQ ID NO:29) with the equivalent 
sequence of mouse Has3 (SEQ ID NO:30). Conserved amino acids are 
30 indicated by a dash (~). (B) Nucleotide (SEQ ID NO:3 1 ) and predicted amino 
acid (SEQ ID NO:32) sequence of the Has3 open reading frame. Sequences 
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representing consensus HA binding motifs are underlined. The location of three 
introns within the gene are indicated by arrowheads. The first intron is located 
immediately preceding the start codon (ATG). 

Figure 13. Northern blot depicting the expression of mouse Has3 at four 
5 different stages of mouse embryonic development. A cDN A probe representing 
the mouse Has3 ORF was radiolabeled and hybridized to a blot containing 
mouse embryonic polyA+ RNAs (CLONTECH) under conditions recommended 
by the manufacturer. 

Figure 14. (A) Amino acid sequence alignment of mouse Has3 (SEQ ID 

10 NO:32) with mouse Has2 (Mhas2) (SEQ ID NO:2), mouse Hasl (Mhasl ) (SEQ 
ID NO:3), Xenopus laevis DG42 (DG42) (SEQ ID NO:4) and Streptococcus 
pyogenes HasA (SEQ ID NO:5). Conserved residues are boxed. Gaps have 
been introduced to maximize the alignment. Asterisks indicate positions at 
which there have been significant conservative amino acid substitutions. (B) 

15 Alignment of two regions of the mouse Has3 protein sequence (SEQ ID NO:35 
and SEQ ID NO:36, respectively) with equivalent regions of related 
glycosyltransferases including mouse Has2 (SEQ ID NO: 7 and SEQ ID NO:2, 
respectively), mouse Hasl (SEQ ID NO:9 and SEQ ID NO:40, respectively), 
Xenopus DG42 (SEQ ID NO: 1 0 and SEQ ID NO:42, respectively), S. pyogenes 

20 HasA (SEQ ID NO: 1 1 and SEQ ID NO:44, respectively), Rhizobium meliloti 
NodC (SEQ ID NO: 12 and SEQ ID NO:46, respectively), Gossypium hirsutum 
putative cellulose synthase Al (celAl) (SEQ ID NO:47 and SEQ ID NO:48 ? 
respectively) and Saccharomyces cerevisiae Chitin synthase 2 (Chs2) (SEQ ID 
NO: 15 and SEQ ID NO:45, respectively). Site-directed mutagenesis of the 

25 residues highlighted in bold of yeast Chs2 resulted in loss of enzymatic activity 
(Nagahashi et al., J. Biol. Chem. . 2Z£L 13961 (1995)), suggesting that these 
residues may be critical for [Jl-4 glycosyltransferase activity. (C) Kyte-Doolittle 
hydrophilicity plots of mouse Has3, mouse Has2, mouse Hasl and S. pyogenes 
HasA. Hydrophobic areas are represented below the axes. Potential 

30 transmembrane domains are indicated by black bars drawn below each plot. 
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Figure 15. COS-1 cells expressing mouse Has 3 hyaluronan coats. HA 
coats were detected as described in the legend to Figure 9. (A) COS-1 cells co- 
transfected with a P-gal expression vector and a vector which expresses mouse 
Has2. (B) COS-1 cells co-transfected with a P-gal expression vector and pCIneo 
5 control vector. (C) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses P-gal before mock treatment with 
hyaluronidase. (D) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses P-gal after mock treatment with 
hyaluronidase. (E) COS-1 ceils co-transfected with a vector which expresses 
10 mouse Has3 and a vector which expresses p-gal before treatment with 

hyaluronidase. (F) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses P-gal after treatment with 
hyaluronidase. 

Detailed Description of the Invention 

15 Definitions 

"Southern analysis" or "Southern blotting" is a method by which the 
presence of DNA sequences in a restriction endonuclease digest of DNA or 
DNA-containing composition is confirmed by hybridization to a known, labeled 
oligonucleotide or DNA fragment. Southern analysis typically involves 

20 electrophoretic separation of DNA digests on agarose gels, denaturation of the 
DNA after electrophoretic separation, and transfer of the DNA to nitrocellulose, 
nylon, or another suitable membrane support for analysis with a radiolabeled, 
biotinylated, or enzyme-labeled probe as described in sections 9.37-9.52 of 
Sambrook et al., Molecular Cloning: A Laboratory Manual . Cold Spring Harbor 

25 (1989). 

"Northern analysis" or "Northern blotting" is a method used to identify 
RN A sequences that hybridize to a known probe such as an oligonucleotide, 
DNA fragment, cDNA or fragment thereof, or RNA fragment. The probe is 
labeled with a radioisotope such as 32 P, by biotinylation or with an enzyme. The 
30 RNA to be analyzed can be usually electrophoretically separated on an agarose 
or poly aery lamide gel, transferred to nitrocellulose, nylon, or other suitable 
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membrane, and hybridized with the probe, using standard techniques well known 
in the art such as those described in sections 7.39-7.52 of Sambrook et al., supra. 

"Polymerase chain reaction" or "PCR" refers to a procedure or technique 
in which amounts of a preselected fragment of nucleic acid, RNA and/or DNA, 
5 are amplified as described in U.S. Patent No. 4,683,195. Generally, sequence 
information from the ends of the region of interest or beyond is employed to 
design oligonucleotide primers. These primers will be identical or similar in 
sequence to opposite strands of the template to be amplified. PCR can be used to 
amplify specific RNA sequences, specific DNA sequences from total genomic 
10 DNA, and cDNA transcribed from total cellular RNA, bacteriophage or plasmid 
sequences, and the like. See generally Mullis et aL, Cold Spring Harbor Symp. 
Quant. Biol. . £L 263 (1987); Erlich, ed., PCR Technology . (Stockton Press, NY, 
1989). 

As used herein "stringent conditions" means conditions that detect a 

1 5 nucleic acid molecule with at least 80%, preferably at least 90%, nucleotide 
sequence homology to the probe or primer sequence. See Sambrook et al., 
Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press 
(2nd ed., 1989) for selection of hybridization and washing conditions for 
DNA:DNA, as well as DNA:RNA (Northern blot), stable and specific duplex 

20 formation. Stringent conditions are those that (1 ) employ low ionic strength and 
high temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium 
citrate (SSC); 0.1% sodium lauryl sulfate (SDS) at 50°C, or (2) employ a 
denaturing agent such as formamide during hybridization, e.g., 50% formamide 
with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 

25 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium 
citrate at 42°C. Another example is use of 50% formamide, 5 x SSC (0.75 M 
NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% 
sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA 
(50 ng/ml), 0.1% sodium dodecylsulfate (SDS), and 10% dextran sulfate at 

30 42°C, with washes at 42°C in 0.2 x SSC and 0.1% SDS. 
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Sources of Nucleic Acids Encoding Has2 or Has3 

A mouse gene has been recently identified that encodes a putative HA 
synthase, Hasl (Itano et al., J. Biol. Chem. . 22L 9875 (1996)). However, the 
results of a complementation analysis conducted by Itano et al. during the 
5 isolation of the Hasl gene indicated that in the mouse, there are at least three 
genes that are involved in HA biosynthesis. Sources of nucleotide sequences 
from which these other genes, i.e., the present DNA molecules encoding Has2 or 
Has3, can be derived include total or polyA + RNA from eukaryotic, preferably 
mammalian, embryonic cells, or mesothelioma and Wilms' tumors or cell lines 
10 derived therefrom, as well as RNA isolated from embryonic tissue samples of 
cartilage, heart, neural tube and the like. Other sources of the DNA molecules of 
the invention include genomic DNA or cDNA libraries derived from any 
eukaryotic source including other mammals, e.g., rat, bovine, equine and the 
like, and other primates, e.g., humans and monkeys. 

15 

Isolation of a Gene Encodin g Has2 or Has3 

A nucleic acid molecule encoding mammalian HA biosynthetic enzymes, 
such as Has2 or Has3, can be identified and isolated using standard methods, as 
described by Sambrook et al., Molecular Cloning: A Laboratory Manual . Cold 

20 Spring Harbor, NY ( 1 989). For example, degenerate reverse-transcriptase PCR 
(RT-PCR) can be employed to isolate and clone Has2 or Has3 genes. This 
approach relies upon conserved sequences deduced from alignments of related 
gene or protein sequences. Sequence analysis of the has A gene of S. pyogenes 
predicts that the HA synthase is a membrane protein with a large intracellular 

25 loop encoding the active site of the enzyme (DeAngelis et al., J. Biol. Chem. . 
268 . supra). Similarly, in mammalian cells, the HA synthase has been localized 
to the plasma membrane, with the active site on the inner face of the membrane 
(Philipson et al., J. Biol. Chem» 252, 5017 (1984); Prehm, Biochem. J. . 22&, 597 
(1984)). Moreover, database searches have identified the Rhizobium sp. 

30 nodulation factor C (NodC) proteins, the Saccharomyces cerevisiae chitin 
synthase 2 (Chs2) proteins, and the Xenopus laevis DG42 protein as sharing 
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sequence identity with Has A (DeAngelis, et aL, Biochem. Biophvs. Res. 
Comn M mi (1994)). 

At least two degenerate primer pools for RT-PCR are prepared, one of 
which is predicted to anneal to the antisense strand, and one of which is 
5 predicted to anneal to the sense strand of a putative eukaryotic DN A molecule 
which encodes HA synthase. The oligonucleotides are made to correspond to 
highly conserved regions of the proteins which were compared to generate the 
primers. 

One degenerate primer pool is then utilized for the first-strand synthesis. 

10 RNA is isolated, e.g., using TRIZOL™ reagent (GIBCO-BRL/Life Technologies, 
Gaithersburg, MD). Reverse transcription reactions are performed on a source of 
nucleic acid believed to contain the DNA or RNA sequences of interest, e.g., 
total RNA isolated from mouse embryos. 

Resultant first-strand cDN As are then amplified in separate PCR 

15 reactions. The products of each PCR reaction are separated via an agarose gel 
and all consistently amplified products are gel-purified and cloned directly into a 
suitable vector, such as a piasmid vector. The resultant plasmids are subjected to 
restriction endonuclease and dideoxy sequencing of double-stranded piasmid 
DNAs. 

20 Another approach to identify, isolate and clone genes which encode 

mammalian HA biosynthetic enzymes is to screen a cDNA library generated 
from embryonic heart or cartilage tissue. Screening for DNA fragments that 
encode all or a portion of the gene encoding Has2 or Has3 can be accomplished 
by probing the library with a probe, which has sequences that are highly 

25 conserved between genes believed to be related to Has2 or Has3, e.g., Hasl, 

Has A, DG42 or NodC, or by screening of plaques for binding to antibodies that 
specifically recognize Has2 or Has3 related proteins. DNA fragments that bind 
to a probe having sequences which are related to Has2 or Has3, or which are 
immunoreactive with antibodies to Has2 or Has3 related proteins, can be 

30 subcloned into a suitable vector and sequenced and/or used as probes to identify 
other cDNA or genomic sequences encoding all or a portion of Has2 or Has3. 
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As used herein, the terms "isolated and/or purified" refer to in vitro 
isolation of a DNA or polypeptide molecule from its natural cellular 
environment, and from association with other components of the cell, such as 
nucleic acid or protein, so that it can be sequenced, replicated, and/or expressed. 
5 For example, "isolated Has2 nucleic acid" is RNA or DNA containing greater 
than 7, preferably 15, and more preferably 20 or more, sequential nucleotide 
bases that encode a biologically active Has2 polypeptide or a fragment thereof, 
or a biologically active variant Has2 polypeptide or a fragment thereof, that is 
complementary to the non-coding strand, or complementary to the coding strand, 

10 of the native Has2 polypeptide RNA or DNA, or hybridizes to said RNA or 
DNA and remains stably bound under stringent conditions. 

"Isolated Has3 nucleic acid" is RNA or DNA containing greater than 7, 
preferably 15, and more preferably 20 or more, sequential nucleotide bases that 
encode a biologically active Has3 polypeptide or a fragment thereof, or a 

1 5 biologically active variant Has3 polypeptide or a fragment thereof, that is 

complementary to the non-coding strand, or complementary to the coding strand, 
of the native Has3 polypeptide RNA or DNA, or hybridizes to said RNA or 
DNA and remains stably bound under stringent conditions. Thus, the RNA or 
DNA is "isolated" in that it is free from at least one contaminating nucleic acid 

20 with which it is normally associated in the natural source of the RNA or DNA 
and is preferably substantially free of any other mammalian RNA or DNA. The 
phrase "free from at least one contaminating source nucleic acid with which it is 
normally associated" includes the case where the nucleic acid is reintroduced 
into the source or natural cell but is in a different chromosomal location or is 

25 otherwise flanked by nucleic acid sequences not normally found in the source 
cell. An example of isolated Has2 nucleic acid is RNA or DNA that encodes a 
biologically active Has2 polypeptide sharing at least about 80%, preferably at 
least about 90%, sequence identity with the Has2 polypeptide of Figure 3. An 
example of isolated Has3 nucleic acid is RNA or DNA that encodes a 

30 biologically active Has3 polypeptide sharing at least about 80%, preferably at 
least about 90%, sequence identity with the Has3 polypeptide of Figure 12B. 
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As used herein, the term "recombinant nucleic acid" or "preselected 
nucleic acid," e.g., "recombinant DNA sequence or segment" or "preselected 
DNA sequence or segment" refers to a nucleic acid, i.e., to DNA that has been 
derived or isolated from any appropriate tissue source, that may be subsequently 
5 chemically altered in vitro^ so that its sequence is not naturally occurring, or 
corresponds to naturally occurring sequences that are not positioned as they 
would be positioned in a genome which has not been transformed with 
exogenous DNA. An example of preselected DNA "derived" from a source, 
would be a DNA sequence that is identified as a useful fragment within a given 

10 organism, and which is then chemically synthesized in essentially pure form. An 
example of such DNA "isolated" from a source would be a useful DNA sequence 
that is excised or removed from said source by chemical means, e.g., by the use 
of restriction endonucleases, so that it can be further manipulated, e.g., 
amplified, for use in the invention, by the methodology of genetic engineering. 

15 Thus, recovery or isolation of a given fragment of DNA from a restriction 

digest can employ separation of the digest on polyacrylamide or agarose gel by 
electrophoresis, identification of the fragment of interest by comparison of its 
mobility versus that of marker DNA fragments of known molecular weight, 
removal of the gel section containing the desired fragment, and separation of the 

20 gel from DNA. See Lawn et al., Nucleic Acids Res. . 2, 6103 (1981), and 
Goeddel et al., Nucleic Acids Res. . & 4057 (1980). Therefore, "preselected 
DNA" includes completely synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
sequences derived from RNA, as well as mixtures thereof. 

25 As used herein, the term "derived" with respect to a RNA molecule 

means that the RNA molecule has complementary sequence identity to a 
particular DNA molecule. 

Variants of the DNA Molecules of the Invention 
30 Nucleic acid molecules encoding amino acid sequence variants of Has2 

or Has3 are prepared by a variety of methods known in the art. These methods 
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include, but are not limited to, isolation from a natural source (in the case of 
naturally occurring amino acid sequence variants) or preparation by 
oligonucleotide-mediated (or site-directed) mutagenesis, PCR mutagenesis, and 
cassette mutagenesis of a DNA molecule encoding an earlier prepared variant or 
5 a non-variant version of Has2 or Has3 polypeptide. 

Oligonucleotide-mediated mutagenesis is a preferred method for 
preparing amino acid substitution variants of Has2 or Has3. This technique is 
well known in the art as described by Adelman et al., DNA . 2, 183 (1983). 
Briefly, Has2 or Has3 DNA is altered by hybridizing an oligonucleotide 

10 encoding the desired mutation to a DNA template, where the template is the 
single-stranded form of a plasmid or bacteriophage containing the unaltered or 
native DNA sequence of Has2 or Has3. After hybridization, a DNA polymerase 
is used to synthesize an entire second complementary strand of the template that 
will thus incorporate the oligonucleotide primer, and will code for the selected 

1 5 alteration in the Has2 or Has3 DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. 
An optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the 

20 single-stranded DNA template molecule. The oligonucleotides are readily 

synthesized using techniques known in the art such as that described by Crea et 
al., Proc. Natl. Acad. Sci. U.S.A. . Z£, 5765 (1978). 

The DNA template can be generated by those vectors that are either 
derived from bacteriophage Ml 3 vectors (the commercially available M13mpl 8 

25 and Ml 3mpl9 vectors are suitable), or those vectors that contain a single- 
stranded phage origin of replication as described by Viera et al., Meth. Enzvmol. . 
153 . 3 (1987). Thus, the DNA that is to be mutated may be inserted into one of 
these vectors to generate single-stranded template. Production of the single- 
stranded template is described in Sections 4.21-4.41 of Sambrook et al., 

30 Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory 
Press, N.Y. 1989). 
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Alternatively, single-stranded DNA template may be generated by 
denaturing double-stranded plasmid (or other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid 
sequence variants, for example), the oligonucleotide is hybridized to the single- 
5 stranded template under suitable hybridization conditions. A DNA polymerizing 
enzyme, usually the Klenow fragment of DNA polymerase I, is then added to 
synthesize the complementary strand of the template using the oligonucleotide as 
a primer for synthesis. A heteroduplex molecule is thus formed such that one 
strand of DNA encodes the mutated form of the Has2 or Has3, and the other 

10 strand (the original template) encodes the native, unaltered sequence of the Has2 
or Has3, respectively. This heteroduplex molecule is then transformed into a 
suitable host cell, usually a prokaryote such as E. coli JM1 01 . After the cells are 
grown, they are plated onto agarose plates and screened using the 
oligonucleotide primer radiolabeled with 32-phosphate to identify the bacterial 

15 colonies that contain the mutated DNA. The mutated region is then removed and 
placed in an appropriate vector for protein production, generally an expression 
vector of the type typically employed for transformation of an appropriate host. 

The method described immediately above may be modified such that a 
homoduplex molecule is created wherein both strands of the plasmid contain the 

20 mutations(s). The modifications are as follows: The single-stranded 

oligonucleotide is annealed to the single-stranded template as described above. 
A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 
deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with 
a modified thiodeoxyribocytosine called dCTP-(aS) (which can be obtained from 

25 the Amersham Corporation). This mixture is added to the template- 

oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a 
strand of DNA identical to the template except for the mutated bases is 
generated. In addition, this new strand of DNA will contain dCTP-(aS) instead 
of dCTP, which serves to protect it from restriction endonuclease digestion. 

30 After the template strand of the double-stranded heteroduplex is nicked 

with an appropriate restriction enzyme, the template strand can be digested with 
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ExoIII nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that 
is only partially single-stranded. A complete double-stranded DNA homoduplex 
is then formed using DNA polymerase in the presence of all four 
5 deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex 
molecule can then be transformed into a suitable host cell such as E. coli JM1 01 . 

A preferred embodiment of the invention is an isolated and purified DNA 
molecule comprising a preselected DNA segment encoding an Has2 polypeptide 
having SEQ ID NO:2, wherein the DNA segment comprises SEQ ID NO: 1 , or 

10 variants of SEQ ID NO:l having nucleotide substitutions which are "silent." 
That is, when nucleotide substitutions are present in a codon, the same amino 
acid is encoded by the codon with the nucleotide substitution as is encoded by 
the codon without the substitution. For example, leucine is encoded by the 
codon CTT, CTC, CTA and CTG. A variant of SEQ ID NO: 1 at the seventh 

15 codon (CTA in SEQ ID NO:l) includes the substitution of CTI, CTC or CTQ 
for CTA- Other "silent" nucleotide substitutions in SEQ ID NO: 1 which can 
encode a polypeptide having SEQ ID NO:2 can be ascertained by reference to 
page Dl in Appendix D in Sambrook et al., Molecular Cloning: A Laboratory 
Manual (1989). Nucleotide substitutions can be introduced into DNA segments 

20 by methods well known to the art. See, for example, Sambrook et al., supra. 

Another preferred embodiment of the invention is an isolated and 
purified DNA molecule comprising a preselected DNA segment encoding an 
Has3 polypeptide having SEQ ID NO:32, wherein the DNA segment comprises 
SEQ ID NO:31, or variants of SEQ ID NO:31 having nucleotide substitutions 

25 which are "silent." That is, when nucleotide substitutions are present in a codon, 
the same amino acid is encoded by the codon with the nucleotide substitution as 
is encoded by the codon without the substitution. For example, leucine is 
encoded by the codon CTT, CTC, CTA and CTG. A variant of SEQ ID NO:3 1 
at the fifth codon (CTQ. in SEQ ID NO:31) includes the substitution of CTT, 

30 CTC or CTA for CT£. Other "silent" nucleotide substitutions in SEQ ID NO:3 1 
which can encode a polypeptide having SEQ ID NO:32 can be ascertained by 
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reference to page Dl in Appendix D in Sambrook et al., Molecular Cloning: A 
Laboratory Manual (1989). Nucleotide substitutions can be introduced into 
DNA segments by methods well known to the art. See, for example, Sambrook 
et al., supra. 

5 

Chimeric Expression Cassettes 

As used herein, "chimeric" means that a vector comprises DNA from at 
least two different species, or comprises DNA from the same species, which is 
linked or associated in a manner which does not occur in the "native" or wild 

10 type of the species. 

The recombinant or preselected DNA sequence or segment, used for 
transformation herein, may be circular or linear, double-stranded or single- 
stranded. Generally, the preselected DNA sequence or segment is in the form of 
chimeric DNA, such as plasmid DNA, that can also contain coding regions 

1 5 flanked by control sequences which promote the expression of the preselected 
DNA present in the resultant cell line. Aside from preselected DNA sequences 
that serve as transcription units for Has2, Has3, or portions thereof, a portion of 
the preselected DNA may be untranscribed, serving a regulatory or a structural 
function. For example, the preselected DNA may itself comprise a promoter that 

20 is active in mammalian cells, or may utilize a promoter already present in the 
genome that is the transformation target. Such promoters include the CMV 
promoter, as well as the SV40 late promoter and retroviral LTRs (long terminal 
repeat elements), although many other promoter elements well known to the art 
may be employed in the practice of the invention. A preferred promoter useful 

25 in the practice of the invention is the CMV promoter. 

Other elements functional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the preselected 
DNA. Such elements may or may not be necessary for the function of the DNA, 
but may provide improved expression of the DNA by affecting transcription, 

30 stability of the mRNA, or the like. Such elements may be included in the DNA 
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as desired to obtain the optimal performance of the transforming DNA in the 
cell. 

"Control sequences" is defined to mean DNA sequences necessary for the 
expression of an operably linked coding sequence in a particular host organism. 
5 The control sequences that are suitable for prokaryotic cells, for example, 

include a promoter, and optionally an operator sequence, and a ribosome binding 
site. Eukaryotic cells are known to utilize promoters, polyadenylation signals, 
and enhancers. 

"Operably linked" is defined to mean that the nucleic acids are placed in 

1 0 a functional relationship with another nucleic acid sequence. For example, DNA 
for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of 
the polypeptide; a promoter or enhancer is operably linked to a coding sequence 
if it affects the transcription of the sequence; or a ribosome binding site is 

1 5 operably linked to a coding sequence if it is positioned so as to facilitate 

translation. Generally, "operably linked" means that the DNA sequences being 
linked are contiguous and, in the case of a secretory leader, contiguous and in 
reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not 

20 exist, the synthetic oligonucleotide adaptors or linkers are used in accord with 
conventional practice. 

The preselected DNA to be introduced into the cells further will generally 
contain either a selectable marker gene or a reporter gene or both to facilitate 
identification and selection of transformed cells from the population of cells 

25 sought to be transformed. Alternatively, the selectable marker may be carried on 
a separate piece of DNA and used in a co-transformation procedure. Both 
selectable markers and reporter genes may be flanked with appropriate 
regulatory sequences to enable expression in the host cells. Useful selectable 
markers are well known in the art and include, for example, antibiotic and 

30 herbicide-resistance genes, such as neo, hpt t dhfr, bar, aroA, dapA and the like. 
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See also, the genes listed on Table 1 of Lundquist et al. (U.S. Patent No. 
5,848,956). 

Reporter genes are used for identifying potentially transformed cells and 
for evaluating the functionality of regulatory sequences. Reporter genes which 
5 encode for easily assayable proteins are well known in the art. In general, a 
reporter gene is a gene which is not present in or expressed by the recipient 
organism or tissue and which encodes a protein whose expression is manifested 
by some easily detectable property, e.g., enzymatic activity. Preferred genes 
include the chloramphenicol acetyl transferase gene (cat) from Tn9 of E. coti, the 

10 beta- glucuronidase gene (gus) of the uidA locus of E. co//, and the luciferase 

gene from firefly Photinus pyralis. Expression of the reporter gene is assayed at 
a suitable time after the DNA has been introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 

1 5 compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al., Molecular Cloning: A 
Laboratory Manual , Cold Spring Harbor Laboratory Press (2d ed., 1989), 
provides suitable methods of construction. 

20 Transformation into Host Cells 

The recombinant DNA can be readily introduced into the host cells by 
transfection with an expression vector comprising DNA encoding Has2, or an 
expression vector comprising DNA encoding Has3, by any procedure useful for 
the introduction into a particular cell, e.g., calcium phosphate precipitation, 

25 lipofection, electroporation, and the like. 

As used herein, the term M cell line" or "host cell" is intended to refer to 
well-characterized homogenous, biologically pure populations of cells. These 
cells may be eukaryotic cells that are neoplastic or which have been 
"immortalized" in vitro by methods known in the art, as well as primary cells, or 

30 prokaryotic cells. The cell line or host cell is preferably of mammalian origin, 
but cell lines or host cells of non-mammalian origin may be employed, including 
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plant, insect, yeast, fungal or bacterial sources. Generally, the preselected DNA 
sequence is resident in the genome of the host cell but is not expressed, or not 
highly expressed. 

"Transfected" or "transformed" is used herein to include any host cell or 
5 cell line, the genome of which has been altered or augmented by the presence of 
at least one preselected DNA sequence, which DNA is also referred to in the art 
of genetic engineering as "heterologous DNA," "recombinant DNA," 
"exogenous DNA," "genetically engineered," "non-native," or "foreign DNA," 
wherein said DNA was isolated and introduced into the genome of the host cell 

10 or cell line by the process of genetic engineering. The host cells of the present 
invention are typically produced by transfection with a DNA sequence in a 
plasmid expression vector, a viral expression vector, or as an isolated linear 
DNA sequence. Preferably, the transfected DNA is a chromosomally integrated 
recombinant DNA sequence, which comprises a gene encoding Has2, or which 

15 comprises a gene encoding Has3, which host cell may or may not express 
significant levels of autologous or "native" hyaluronan. 

Has2 or Has 3 Polypeptides 

The present invention provides an isolated, purified Has2, or an isolated, 

20 purified Has3, which can be prepared by recombinant DNA methodologies. The 
general methods for isolating and purifying a recombinantly expressed protein 
from a host cell are well known to those in the art. Examples of the isolation and 
purification of such proteins are given in Sambrook et al., cited supra. 
Moreover, since the present invention provides the complete amino acid 

25 sequence of murine Has2 (Figure 3), and murine Has3 (Figure 12B), they or 
bioactive variants thereof can also be synthesized by the solid phase peptide 
synthetic method. This established and widely used method, including the 
experimental procedures, is described in the following references: Stewart et al., 
Solid Phase Peptide Synthesis . W. H. Freeman Co., San Francisco (1969); 

30 Merrifield, J. Am. Chem. Soc . 2149 (1963); Meienhofer in "Hormonal 

Proteins and Peptides," ed.; C.H. Li, Vol. 2 (Academic Press, 1973), pp. 48-267; 
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and Bavaay and Merrifield, "The Peptides," eds. E. Gross and F. Meienhofer, 
Vol. 2 (Academic Press, 1980) pp. 3-285. 

When Has2 or Has3 polypeptide is expressed in a recombinant cell, 
preferably a Has2- or Has3- cell, respectively, it is necessary to purify Has2 or 
5 Has3 polypeptide from recombinant cell proteins or polypeptides to obtain 
preparations that are substantially homogenous as to Has2 or Has3 polypeptide. 
For example, the culture medium or lysate can be centrifuged to remove 
particulate cell debris. The membrane and soluble protein fractions are then 
separated. The Has3 polypeptide may then be purified from the soluble protein 

10 fraction and, if necessary, from the membrane fraction of the culture lysate. 
Has3 polypeptide can then be purified from contaminant soluble or membrane 
proteins and polypeptides by fractionation on immunoaffinity or ion-exchange 
columns; ethanol precipitation; reverse phase HPLC; chromatography on silica 
or on an anion-exchange resin such as DEAE; chromatofocusing; SDS-P AGE; 

15 ammonium sulfate precipitation; gel filtration using, for example, Sephadex G- 
75; or ligand affinity chromatography. 

Has2 polypeptide, Has3 polypeptide, variant Has2 polypeptides, variant 
Has3 polypeptides, or biologically active subunits thereof can also be prepared 
by in vitro transcription and translation reactions. For example, a Has3 

20 expression cassette can be employed to generate Has3 transcripts which are 
subsequently translated in vitro so as to result in a preparation of substantially 
homogenous Has3, variant Has3, or biologically active subunits thereof The 
construction of vectors for use in vitro transcription/translation reactions, as well 
as the methodologies for such reactions, are well known to the art. 

25 Once isolated from the resulting transgenic host cells or from in vitro 

transcription/translation reactions, derivatives and chemically derived variants of 
the Has2 polypeptide or Has 3 polypeptide can be readily prepared. For 
example, amides of the Has3 polypeptides of the present invention may also be 
prepared by techniques well known in the art for converting a carboxylic acid 

30 group or precursor, to an amide. A preferred method for amide formation at the 
C-terminal carboxyl group is to cleave the polypeptide from a solid support with 
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an appropriate amine, or to cleave in the presence of an alcohol, yielding an 
ester, followed by aminolysis with the desired amine. 

Salts of carboxyl groups of the Has2 polypeptide or Has3 polypeptide 
may be prepared in the usual manner by contacting the peptide with one or more 
5 equivalents of a desired base such as, for example, a metallic hydroxide base, 
e.g., sodium hydroxide; a metal carbonate or bicarbonate base such as, for 
example, sodium carbonate or sodium bicarbonate; or an amine base such as, for 
example, triethylamine, triethanolamine, and the like. 

N-acyl derivatives of an amino group of the present polypeptides may be 

10 prepared by utilizing an N-acyl protected amino acid for the final condensation, 
or by acylating a protected or unprotected peptide. O-acyl derivatives may be 
prepared, for example, by acylation of a free hydroxy peptide or peptide resin. 
Either acylation may be carried out using standard acylating reagents such as 
acyl halides, anhydrides, acyl imidazoles, and the like. Both N- and O-acylation 

1 5 may be carried out together, if desired. In addition, the internal Has2 or Has3 

amino acid sequence of Figure 3 or Figure 12B, respectively, can be modified by 
substituting one or two conservative amino acid substitutions for the positions 
specified, including substitutions which utilize the D rather than L form. The 
invention is also directed to variant or modified forms of the Has2 polypeptide or 

20 Has 3 polypeptide. One or more of the residues of the Has 2 polypeptide can be 
altered, so long as the variant polypeptide has at least about 50% of the 
biological activity of the protein having SEQ ID NO:2. One or more of the 
residues of the Has 3 polypeptide can be altered, so long as the variant 
polypeptide has at least about 50% of the biological activity of the protein 

25 having SEQ ID NO:32. Conservative amino acid substitutions are preferred—that 
is, for example, aspartic-glutamic as acidic amino acids; lysine/arginine/histidine 
as basic amino acids; leucine/isoleucine, methionine/valine, alanine/valine as 
hydrophobic amino acids; serine/glycine/alanine/threonine as hydrophilic amino 
acids. 

30 Acid addition salts of the polypeptides may be prepared by contacting the 

polypeptide with one or more equivalents of the desired inorganic or organic 
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acid, such as, for example, hydrochloric acid. Esters of carboxyl groups of the 
polypeptides may also be prepared by any of the usual methods known in the art. 

Has2 or Has 3 Variant Polypeptide? 

5 It is envisioned that variant Has2 polypeptides have at least one amino 

acid substitution relative to SEQ ID NO:2. It is also envisioned that variant Has3 
polypeptides have at least one amino acid substitution relative to SEQ ID NO:32. 
In particular, amino acids are substituted in a relatively conservative manner. 
Such conservative substitutions are shown in Table 1 under the heading of 
1 0 exemplary substitutions. More preferred substitutions are under the heading of 
preferred substitutions. After the substitutions are introduced, the products are 
screened for biological activity. 
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TABLE 1 



Original 


Exemplary 


Preferred 


Residue 


Substitutions 


Substitutions 


Ala (A) 


val; leu; ile 


val 


Arg (R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


Asp (D) 


glu 


glu 


Cys (C) 


ser 


ser 


Gin (Q) 


asn 


asn 


Glu (E) 


asp 


asp 


Gly (G) 


pro 


pro 


His (H) 


asn; gin; lys; arg 


arg 


He (I) 


leu; val; met; ala; phe 


leu 




norleucine 




Leu (L) 


norleucine; ile; val; met; 


ile 




ala; phe 




Lys (K) 


arg; gin; asn 


arg 


Met (M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala 


leu 


Pro (P) 


gly 


gly 


Ser (S) 


thr 


thr 


Thr (T) 


ser 


ser 


Trp (W) 


tyr 


tyr 


Tyr (Y) 


trp; phe; thr; ser 


phe 


Val (V) 


ile; leu; met; phe; ala; 


leu 




norleucine 





Amino acid substitutions falling within the scope of the invention, are, in 
general, accomplished by selecting substitutions that do not differ significantly 
in their effect on maintaining (a) the structure of the polypeptide backbone in the 
area of the substitution, for example, as a sheet or helical conformation, (b) the 
30 charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the 
side chain. Naturally occurring residues are divided into groups based on 
common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 
35 (3) acidic: asp, glu; 
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(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic; tip, tyr, phe. 

The invention also envisions Has2 or Has3 variants with non- 
5 conservative substitutions. Non-conservative substitutions entail exchanging a 
member of one of the classes described above for another. Amino acid 
substitutions are introduced into the DNA molecules of the invention by methods 
well known to the art. For example, see the description hereinabove for the 
introduction of silent mutations into the DNA molecules of the invention. 

10 

Uses of Has2 or Has3 Genes and Polypeptide s Thereof 

The genes involved in HA biosynthesis and extracellular accumulation of 
HA ("HA coat formation") can be employed to synthesize HA in vitro. Because 
in vitro synthesized HA is of extremely high purity, is free from bacterial and 
15 animal cell contaminants, and can be optimized as to its physicochemical 

properties, it is preferred to HA derived by extraction from bacterial or animal 
sources. 

In vitro prepared HA has a similar range of applications as those 
described above for HA which is derived from animal or bacterial cells, e.g., 

20 protecting eye tissue during artificial intraocular lens implantation, as a drug 

delivery vehicle, and preventing or inhibiting post-operative adhesions. In vitro 
synthesized HA may also be employed to enhance or promote wound healing or 
tissue repair, e.g., to prevent restenosis following balloon angioplasty, and to 
repair or replace damaged or absent cartilage present in congenital defects, 

25 craniofacial disorders and arthritis. In addition, HA can be derivatized, as 

described in Balazs et al. (Blood Coag. Fibrinolysis . 2, 173 (1991)), to provide 
improved mechanical properties and an extended residence time in vivo. 

Moreover, the identification of genes involved in HA biosynthesis and/or 
coat formation may also be useful for defining the molecular basis for genetic 

30 diseases, such as cartilage pathologies, e.g, rheumatoid arthritis, and for 

providing a clinically useful diagnostic test or in molecular-based therapeutics. 
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Once such a gene has been identified, a probe specific for the gene can be made. 
Patient DNA can be screened with the probe to detect particular genetic variants 
that correlate with disease, e.g., craniofacial disorders. Patient RNA can be 
incubated with the probe to determine if the gene is over or under expressed in a 
5 patient with a particular disease relative to disease-free patients. 

Furthermore, the cloning of genes involved in HA biosynthesis and/or 
extracellular coat formation will help to elucidate the molecular mechanism 
giving rise to the alteration of the protein encoded by the gene, or its expression, 
in patients having a particular disorder, e.g., cartilage deficiency. Once the 

10 molecular mechanism underlying the expression of the gene is understood, 

molecular genetic-based therapies directed to controlling the expression of the 
gene can then be employed to correct or supplement the expression of the gene 
in patients with the disorder. 

For example, accelerated HA degradation accompanies osteoarthritis and 

1 5 inflammatory arthritides. Thus, the administration of Has2 and/or Has3 
polypeptide, expression vectors encoding Has2 and/or Has3 polypeptide or 
agents that increase the expression or activity of native (i.e., endogenous) Has2 
and/or Has3 may be efficacious for diseases which are characterized by 
decreased levels of HA. Hyperthyroidism (Graves Disease) is associated with 

20 excessive accumulation of HA in retro-orbital connective tissues, in the pretibial 
area and elsewhere. In addition, various ill-characterized skin disorders or 
mucinosis are also associated with accumulation of HA in the dermis. Thus, the 
administration of agents that inhibit the expression or activity of native Has2 
and/or Has3 or expression vectors comprising has2 and/or has3 antisense 

25 sequences, may be useful to prevent or treat these disorders. 

In addition, high serum levels of HA are associated rheumatoid arthritis, 
septic conditions accompanying certain malignancies, e.g., mesothelioma and 
Wilms' tumor, and edema due to inflammation in the lung and in kidneys post- 
kidney transplantation. HA has also been implicated in Grave's ophthalmopathy, 

30 cirrhosis of the liver and accelerated aging in Werner's syndrome. Thus, the 
isolation of eukaryotic HA biosynthetic genes can be useful in gene therapies 
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which employ the cloned genes in ant i sense expression vectors to inhibit or 
reduce the overexpression of HA genes in these patient populations. For 
example, an expression vector containing antisense Has3 can be introduced into 
joints (for rheumatoid arthritis), or into mesothelioma or Wilms' tumor cells, to 
5 inhibit or reduce the overexpression of Has3. 

Identification of Agents that Alter Has2 and/or Has3 Expression or Activity 

Agents that increase or decrease native Has2 or Has3 activity or 
expression may be identified using in vitro assays. For example, cells with low 

10 basal Has2 or Has3 activity, such as Chinese Hamster Ovary (CHO) cells, are 
stably transfected with recombinant plasmids that express Has2 and/or Has3. 
The resulting cell lines are then contacted with an agent and the amount of HA 
synthesized or secreted, and the amount of HA coat formation, in the presence of 
the agent relative to cells not exposed to the agent, is determined, using methods 

15 described herein. To assess coat formation, a bead binding assay may be 
employed. In this assay, polypeptide fragments with HA binding activity 
(so-called HA binding domain or HABR) are covalently attached to micro-beads 
tagged by fluorescent or other means (e.g., biotinylation). Agents that enhance 
HA coat formation may be useful to decrease the adhesive properties of tissue, 

20 e.g., mesothelial, surfaces. 

Screening for agents that regulate Has2 and/or Has3 activity may also be 
accomplished using an assay described in Spicer et al., ( J. Biol. Chem. . 272 . 
8957 (1997)). Radiolabeled UDP-sugar substrates (either UDP-N-acetyl-D- 
glucosamine or UDP-D-glucuronate) in the presence of the other required 

25 substrates are incubated with membrane extracts (10-25 mg protein) in the 

presence or absence of the agent for 2 hours at 37°C. The radiolabeled precursor 
molecules are then separated from the high molecular weight HA product by 
paper chromatography and agarose gel electrophoresis. Paper chromatography 
allows accurate quantification of enzyme activity, while agarose gel 

30 electrophoresis allows rapid assessment of molecular mass. Filter assays using 
precipitation with cetylpyridinium chloride or HPLC isolation of reductive 
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products of HA degradation by Streptomyces hyaluronidase may also be 
employed. Direct interaction of an agent with Has2 and/or Has3 may be 
determined by binding assays utilizing purified, recombinant Has2 and/or Has3 
polypeptide present in liposomes or detergent micelles and labeled agent. 
5 Agents that interact with highly conserved sequences present in enzymes 

involved in synthesis of P 1-4 linkages may be useful to inhibit native Has2 
and/or Has3. S. cerevisiae chitin synthase 2 (Chs2) has two highly conserved 
domains present in all chitin synthases that are critical to enzymatic activity and 
speculated to be generally conserved in glycosyltransferases that catalyze the 

10 synthesis of oligosaccharides with (J 1 -4 linkages (Nagahasi et al., J. Biol 
Chem. . 270 . 13961 (1995)). Sequence alignments of Has 1, mHas2, DG42, 
HasA, NodC, and Chs2 revealed that several amino acid residues required for 
catalytic activity of Chs2 are conserved in mHas2 and mHas3. In particular, the 
second region of homology in Chs2 contains the highly conserved motif 

15 NMYLA-EDRIL residues (556-565; SEQ ID NO:56). Mutations at residue 562 
in Chs2 resulted in complete loss of enzymatic activity. The similarity of mHas2 
in this region (NQCSFGDDRH; SEQ ID NO:57) suggests that mutation of the 
highly conserved D at position 314 may result in loss of enzymatic activity. 
Expression of a mutant mHas2, having an amino acid substitution (D- A) at this 

20 position, in COS-1 cells did not result in coat formation. Similarly, agents that 
are ligand mimetics, e.g., 5-azido-UDP-glucuronic acid, may be tested for their 
ability to alter Has2 and/or Has3 activity. Thus, agents that interact with 
domains which comprise residues required for catalytic activity may be useful in 
vivo inhibitors of Has2 and/or Has3 activity. 

25 

Methods to Administer has2 or has3 Genes or Polypeptides to Tissue Surfaces 

Delivery of has2 and/or has3 genes (e.g., in viral vectors or liposomes) or 
purified Has2 and Has3 polypeptide (e.g., in liposomes) to tissue, e.g., 
mesothelial, surfaces provides an alternative approach to exogenous instillation 
30 of HA containing solutions or HA containing films to coat opposing surfaces 
with HA, to decrease adhesivity. To determine whether has genes or purified 
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Has polypeptide are useful to alter mesothelial HA synthesis or accumulation, 
cultured mesothelial cells are transfected with has2 and/or has3 expression 
vectors and/or contacted with purified Has2 and/or Has3 polypeptide. Sections 
of serosa stripped off of the underlying mesothelial tissue may also be employed. 
5 These sections are maintained in suspended well culture (e.g., Becton- Dickson 
Transwells) which allows access of nutrients to epithelial sheets. Radiolabeled 
precursors (e.g., 3 H or M C labeled N-acetyl-D-glucosamine) can be added to the 
culture medium of cultured cells or serosa, and secretion of HA analyzed by 
removing the culture medium, and determining the incorporation of radiolabeled 

10 precursor into a high molecular weight form (e.g., > 1 x 10 6 Daltons) which is 
sensitive to degradation by Streptomyces hyaluronylticus . HA coat formation 
can also be determined by fixation of the cells in the presence of 
cetyltrimethylammonium bromine (CTAB), followed by immunohistochemical 
staining with purified HA binding domain conjugated to biotin. 

1 5 These in vitro tests can be extended to in vivo models in small animals 

(e.g., rats, mice), in which viral vectors containing cDNAs encoding Has2 and/or 
Has3, or purified, recombinant Has2 and/or Has3 polypeptide are introduced into 
the peritoneal cavity. To assess optimal dosing, two approaches are envisioned. 
First, to optimize the production of HA by the peritoneal surface, extensive 

20 peritoneal lavage to remove free HA is performed. The HA can be quantified, 
using methods outlined herein. Then, fixation in situ using CTAB containing 
fixative, followed by staining for HA with biotinylated HA binding domain is 
employed to show cell surface HA. Optimal dosages of viral vectors and7or 
recombinant polypeptide depend upon the specific application (e.g., operative 

25 site, specific surgery) and desired outcome (persistence of HA secretion and 

anti-adhesive properties). The presence or amount of HA on mucosal or serosal 
surfaces in vivo can be determined using labeled proteins containing HA binding 
domains (Ripellino et al., J. Histochem. & Cvtochem.. 1060 (1985); 
Fenderson et al., Different. , 54 , 85 (1993)). Likewise, small molecules, 

30 identified on the basis of their ability to stimulate or inhibit HA secretion in vitro 
can be tested in similar models. 
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The invention will be further described by the following examples. 

Example 1 

cDNA Cloning and Characterization of Mouse Hyaluronan Synthase-2 

The aligned amino acid sequences of HasA, DG42 and NodC were 
5 utilized to prepare primers for a degenerate PCR strategy to identify a 

HasA/DG42 related cDNA in the mouse. Three degenerate primer pools for RT- 
PCR were prepared, two of which were predicted to anneal to the antisense 
strand, and one of which was predicted to anneal to the sense strand of a putative 
eukaryotic DNA molecule which encodes HA synthase. The oligonucleotides 

10 were made corresponding to the peptide sequences AFNVERACQ (SEQ ID 

NO: 14), GDDRHLTN (SEQ ID NO: 15), and QQTRWTKSYF (SEQ ID NO: 16), 
and had the following degenerate nucleotide sequences: DEG 1 primer, 5'-GCN 
TTY AAY GTN GAR MGN GCN TOY CA 3' (SEQ ID NO: 17, sense strand), 
DEG 3 primer, S'-RTT NGT NAR RTG NCK RTC RTC NCC-3' (SEQ ID 

15 NO: 1 8, antisense strand), and DEG 5 primer, 5'-RAA RTA NSW YTT NGT 
CCA NCK NGT YTG YTG-3' (SEQ ID NO: 19, antisense strand). 

A degenerate primer pool made to the peptide sequence QQTRWTKSYF 
(SEQ ID NO: 16, DEG 5) was utilized for the first-strand synthesis. RNA was 
isolated using TRIZOL™ reagent (GIBCO-BRL/Life Technologies, 

20 Gaithersburg, MD) according to the manufacturer's directions. Reverse 

transcription reactions were performed on total RNA isolated from 10.5 and 14.5 
days post coitum (dpc) C57BL/6J mouse embryos. Briefly, 5 |ig of total RNA 
were heat-denatured at 95 °C then split into two separate reactions. One reaction 
served as a control and amplified a fragment of 28S ribosomal RNA. The 

25 second reaction received one of two degenerate primer pools at a final 

concentration of 2 jiM. Reverse-transcription was carried out at 42 °C using 10 
units M-MuLV reverse transcriptase (Boehringer Mannheim, Indianapolis, IN) 
in a total volume of 25 

Five microliters of each resultant first-strand cDNA were amplified in 

30 separate 100 jal PCR reactions using combinations of degenerate primer pools 
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1 and 3 (DEG 1/3) or 1 and 5 (DEG 1/5). Amplification conditions were as 
follows: 35 cycles of 94°C for 1 minute, 50°C for 1 minute, 72°C for 1 minute, 
followed by a final extension of 72 °C for 10 minutes. Primer pools were used at 
a final concentration of 1 |iM. Twenty microliters of each PCR reaction was 
5 separated through a 2.0% agarose gel (Figure 1). All consistently amplified 
products (see arrows in Figure 1) were gel-purified and cloned directly into a 
pBluescript KSII+ (Stratagene Cloning Systems, La Jolla, CA) T-vector 
prepared as described by Marchuk et al. f Nucleic Acids Re<; r . 1 154 (1991)). 
The resultant plasmids were subjected to restriction endonuc lease and dideoxv 

10 sequencing of double-stranded plasmid DNAs using a Sequenase Version 2.0 
sequencing kit (United States Biochemical Corp, Cleveland, OH). 

The 300 bp DEG 1/5 product (MHas300) and the 180 bp DEG 1/3 
product were related by a common internal site for the restriction endonuclease 
EcoRI, as shown below the gel image in Figure 1 . Sequence analysis of the 

1 5 other consistently amplified PCR products indicated that they were unrelated to 
mouse HAS (Itano et al., J. Biol. Chem. 221, 9875 (1996)) hasA, DG42, nodC, 
and the 1 80 bp and 300 bp PCR products. 

The 300 bp cDNA fragment, MHas300 was utilized as a probe to screen 
a primary A.gtl0 cDNA library constructed from 8.5 dpc C57BL/6J polyA+ RNA 

20 (kindly provided by Dr. J. J. Lee, Mayo Clinic Scottsdale). The probe was 
labeled to high specific activity using random-priming in the presence of 
[« 32 P]dCTP (Feinberg et al., Anal. Biochem.. H2» 6 (1984)). Approximately 
1.5 x 10 6 plaque-forming units (pfus) were screened using standard procedures 
(Sambrook et al., Molecular Cloning: A Laboratory Manual . Cold Spring Harbor 

25 (1989)). Double positive plaques were identified and taken through two 

additional rounds of plaque-purification. In addition, a portion of each primary 
plaque was screened by PCR, employing a combination of primers that flanked 
the AgtlO cloning site and MHas2 specific primers, to determine insert size 
relative to the MHas300 cDNA fragment. Fourteen positive clones were obtained 

30 and analyzed. The mouse X cDNA library yielded multiple overlapping clones, 
which collectively spanned approximately 3 kb (Figure 2). EcoRI restriction 
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fragments were then subcloned into pBluescript KSII+ for sequence analysis. 
The nucleotide sequence of both strands was determined using synthetic 
oligonucleotide primers made to the mouse Has2 sequence and to the vector. 

Sequence analyses identified an open reading frame (ORF) of 1656 bps, 
5 flanked by 5' and 3' untranslated regions (UTRs) of 507 and 772 bps, 

respectively (Figure 3, SEQ ID NO: 1 ). The open reading frame predicted a 63 
kDa protein with several transmembrane sequences, multiple consensus 
phosphorylation sites, and four putative hyaluronan binding motifs. The 
predicted translation initiation site conformed to the Kozak consensus for 

10 initiation (Kozak, Nucleic Acids Res. . 12 . 857 (1984)). Although there were four 
additional upstream ATGs within the 5 f UTR, none of these fitted the Kozak 
consensus and all were followed closely by in-frame stop codons. The presence 
of several upstream ATGs has, however, been more commonly described in 
oncogenic sequences (Kozak, Nucleic Acids Res., 15 , 8125 (1987)). The 3' UTR 

15 contained two consensus sequences for polyadenylation, a CA repeat and a TA 
repeat (Figure 3). 

Database searches indicated that the predicted amino acid sequence of 
mouse Has2 (SEQ ID NO:2) aligned most significantly with Xenopus DG42 
(SEQ ID NO: 10; 56% identity, 70% similarity; Rosa et al., Dev. Biol. . 129 . 1 14 

20 (1987)), Streptococcal HasA (SEQ ID NO:l 1; 21% identity, 28% similarity; 

DeAngelis et al., J. Biol. Chem. . 268 . 19181 (1993)), Rhizobium sp. NodC (SEQ 
ID NO: 12; Jacobs et al., J. BacterioL , 162 . 469 (1 985); Collins-Emerson et al., 
Nucleic Acids Res. . 18 . 6690 (1 990)), and Saccharomyces cerevisiae chitin 
synthase 2 (Chs2) (SEQ ID NO: 13; Bulawa, Mol. Cell. Biol. . XL 1764 (1992)) 

25 (Figure 5). In addition, mouse Has2 displayed 55% identity and 73% similarity 
to the recently reported mouse Hasl gene (SEQ ID NO: 1 1 , Itano et al., J. Biol. 
Chem. . 271 . 9875 (1996)), and the human homologue of this gene (Yang et al., 
EMBO J. . 13 « 286 (1994)). Surprisingly, the deduced amino acid sequence of 
the cDNA of Itano et al. is distinct from the Has2 cDNA described hereinbelow, 

30 although the sequences are clearly related. 
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Recently isolated clones for a second human Has gene, which shares 
greater than 90% amino acid identity to mouse Has2 and thus is predicted to 
represent the human Has2 gene have also been obtained (SEQ ID NO:23). This 
suggests that there are at least two related Has genes in both mouse and humans, 
5 Investigation of the primary amino acid sequence of mouse Has2 

identified several potential transmembrane sequences (Figure 4), four potential 
HA binding motifs fitting the B(X 7 )B consensus (Yang et al., EMBO J. JJ, 286 
(1994)), and numerous consensus sequences for phosphorylation by protein 
kinase C (PKC) and cyclic-AMP dependent kinases, such as protein kinase A 

10 (PKA) (Person et al., In: Protein Phosphorylatio n: A Practical Approach 

(Hardie, D. G., ed), IRL Press at Oxford University Press, Oxford (1993)). Has2 
is predicted to be a multiple membrane-spanning protein with a large 
cytoplasmic loop, similar to the predicted structure of Streptococcus HasA and 
mouse HAS (Hasl) (Figure 6B). Sequence alignment of Has2 with 

1 5 Saccharomyces cerevisiae Chitin synthase2 (Chs2; SEQ ID NO: 13) (Figure 5) 
demonstrated that the residues recently shown to be required for catalytic activity 
in Chs2 (Nagahashi et al., J. Biol. Chem. . 22Q, 13961 (1995)) are conserved 
within the large predicted cytoplasmic loop of mouse Has2 (Figure 6B). It has 
been suggested that these catalytic residues may be generally conserved within 

20 glycosyltransferases that catalyze the synthesis of oligosaccharides with p 1-4 
linkages (Nagahashi et al., supra). Significantly, the predicted cytoplasmic loop 
of the Has2 molecule is the most highly conserved across species, and thus this 
part of the protein may form the catalytic domain. 

Example 2 

25 Molecular Biochemical Characterization of Mouse Has2 

Northern and Southern Analysis. Mouse multiple tissue Northern (MTN) 
Blots (CLONTECH, Palo Alto, CA) were hybridized to a [« 32 P]dCTP-Iabeled 
cDNA probe corresponding to the 1 .65 kb open-reading- frame (ORF) of the 
mouse Has2 gene. Blots were hybridized at 42 °C and washed to high stringency 

30 according to the manufacturer's recommendations. The mouse embryo blot was 
exposed overnight at -70°C to BioMax MR film (Eastman Kodak Company, 
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New Haven, CT) with two intensifying screens, whereas the adult tissue blot was 
exposed for six days at -70 °C with two screens. To control for variation in 
loading, both blots were stripped, and rehybridized with a mouse 
glyceraldehyde-3-phosphate dehydrogenase (GAPDH) probe. Both GAPDH 
5 hybridized blots were exposed for one hour at 
-70°C with two screens. 

Northern analyses detected two transcripts of approximately 3.2 kb and 
4.8 kb, respectively, in embryonic samples (Figure 7). Only the 4.8 kb message 
was observed in RNA from adult tissues. The 4,8 kb transcript was expressed at 

10 levels approximately 20 fold higher than the 3.2 kb transcript. High levels of 
expression were observed in the developing mouse embryo, in addition to lower 
levels in adult mouse heart, brain, spleen, lung and skeletal muscle (Figure 6). 
All of the isolated cDNA clones were predicted to form an identical ORF. Thus, 
rather than being the result of alternate splicing, the 4.8 kb transcript most 

15 probably corresponds to a mouse Has2 mRNA with an alternate polyA signal, 
generating a 3' UTR with approximately 1.8 kb of sequence, in addition to that 
reported herein. 

Moreover, the observed expression pattern of mouse Has2, i.e., Has2 
expression was detected in the primitive streak stage embryo (7.5 dpc) and an 

20 increase in Has2 expression in the later embryo, correlates well with the 
previously described expression pattern of HA. HA has previously been 
observed at significant levels starting as early as the egg cylinder stage (5.5 dpc), 
when it is secreted into the expanding yolk cavity. Thus, HA may play a role in 
the formation and expansion of embryonic cavities. From 9.5 dpc, synthesis 

25 increases, and the HA assumes more of a pericellular distribution, rather than 
being primarily associated with fluid-filled spaces. HA is present at high levels 
within the developing vertebral column, the neural crest-derived mesenchyme of 
the craniofacial region, and the heart and smooth muscle throughout the mid- 
gestation embryo. 

30 In the adult, Has2 expression was detected in heart, brain, spleen, lung 

and skeletal muscle, but not in liver or kidney (Figure 7). The level of 
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expression of Has2 was markedly reduced in adult tissues as compared to the 
embryo. 

Mouse 129Sv/J genomic DNA was prepared from tail snips using 
standard procedures. Approximately 15 jig samples of genomic DNA were 
5 digested overnight with restriction endonuc leases, size-separated through 0.8% 
agarose gels, and transferred to Hybond N+ nylon membranes (Amersham, 
Arlington Heights, IL). Membranes were hybridized to a [« 32 P]dCTP-labeled 
cDNA probe corresponding to the 1.65 kb ORF of mouse Has2. Hybridization 
conditions were performed as recommended by the manufacturer. Membranes 
10 were washed to low (1 X SSC + 0.1% SDS at 37°C) and high (0.1 X SSC + 
0.1% SDS at 55 °C) stringency (1 X SSC (saline sodium citrate) is 1 50 mM 
NaCl, 1 5 mM Na citrate) and autoradiography was performed as described 
above. 

The pattern of hybridizing restriction fragments that was observed 

1 5 through Southern analyses was consistent with mouse Has2 being a single copy 
gene within the mouse genome (Figure 8). In addition, the pattern observed in 
digests of total mouse genomic DNA was identical to that observed in equivalent 
digests of recently isolated mouse Has2 genomic clones. Low stringency wash 
conditions failed to identify any further hybridizing fragments including those 

20 fragments corresponding to the related mouse Hasl (Itano et al., supra) gene. 

This suggests that the level of sequence identity (55%) between mouse Has2 and 
mouse Hasl, and possibly other Has-related genes, is not sufficient to permit 
detection through Southern hybridization even at low stringency. Thus, while 
these results preclude the existence of a mouse Has2 pseudogene, they do not 

25 preclude the existence of other genes related to mouse Has2 and mouse Hasl . 

Transfection Studies . To investigate the potential role of mouse Has2 in 
HA biosynthesis, expression constructs were created in the mammalian 
expression vector, pCIneo (Promega Corporation, Madison, WI), Mouse Has2 
ORFs were amplified by PCR, from a template of mouse Has cDNA clone X \ 1 . 1 

30 (Figure 2). PCR primers were designed to create a mouse Has2 cDN A with an 
optimized Kozak consensus A— ATGG, and to contain Smal/Xmal sites at each 



BNGOOdD: «WD_90OO661A2JU> 



WO 98/00551 



PCT/US97/11761 



43 

end suitable for cloning. Primers were as follows: S'-CCCGGGCAAG ATG 
GAT TGT GAG AGG TTT CTA TGT GTC CTG -3' (SEQ ID NO:21 , bps 504 
to 537, Figure 3) and 5'-CCCGGG TCA TAC ATC AAG CAC CAT GTC ATA 
CTG -3' (SEQ ID NO:22, bps 2163 to 2137, Figure 3). Gel-purified PCR 
5 products were cloned directly into a pBluescript KSII+ T-vector for sequence 
verification, prior to subcloning into the Xmal site of pCIneo. 

The mouse Has2 expression vector was co-transfected with a 
cytomegalovirus promoter (CMV) driven p-gal expression vector into COS-1 
(SV40-transformed African green monkey kidney) cells (Gluzman, CelL 23 , 175 

10 ( 1 98 1 )) using Lipofectamine™ (GIBCO-BRL/Life Technologies, Gaithersburg, 
MD), according to the manufacturer's instructions. The P-gal expression plasmid 
was used in all transfections to permit the visual identification of cells that had 
been successfully transfected. Control co-transfections were pCIneo (vector 
control) and LacZ vector. Cells were analyzed 36 hours after lipofection 

15 (transient transfection). The COS-1 cell line and the mouse 3T6 (Swiss 

embryonic fibroblast) cell line were routinely maintained at 37°C in Dulbecco's 
modified Eagle's medium (DMEM) supplemented with 10% fetal bovine serum 
(FBS) and 2 mM L-glutamine, in a humidified chamber at 5% C0 2 . 

HA Coat Assays. Glutaraldehyde fixed horse erythrocytes (Sigma 

20 Chemical Company, St. Louis, MO) were reconstituted in phosphate-buffered 
saline (PBS), washed several times to remove traces of sodium azide, and finally 
resuspended in PBS plus 1 mg/ml BSA to a density of 5 x 10 8 cells/ml. HA 
coats were visualized around live cells growing in individual wells of a 24-well 
plate or 6- well plate by adding 1 x 10 7 or 5 x 10 7 red blood cells, respectively, to 

25 the growth medium. Red cells were allowed to settle for 1 5 minutes before HA 
coats were scored. To confirm the coats as being composed of HA, red cells 
were removed by extensive washing with PBS, and one well of each 
experimental sample was treated with 10 units/ml bovine testicular 
hyaluronidase (CALBIOCHEM, San Diego, CA) or 5 units/ml Streptomyces 

30 hyaluronidase (CALBIOCHEM, San Diego, CA) in DMEM plus 0.5% FBS for 
1 hour at 37°C. Equivalent wells were incubated under the same conditions in 
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the absence of hyaluronidase. After incubation, red cells were added to the 
wells, as previously described, and coats were again scored. HA coats were 
imaged at 200x magnification. After imaging, red cells were removed by 
extensive washing with PBS. Cells were stained to detect P-galactosidase 
5 (LacZ) activity and imaged as described by Sanes et al. EMBO J. _ 5,3133 
(1986). 

Parental, untransfected COS-1 cells had no detectable coat-forming 
ability in HA pericellular coat-forming assays (Figure 9B). In contrast, 
untransfected 3T6 mouse embryonic fibroblast cells had well-developed HA 

10 coats (Figure 9A). Transient co-transfection of mouse Has2 and LacZ 

expression constructs into COS-1 cells resulted in the production of large HA 
coats (Figure 9D-I). Cells acquiring an HA coat also stained positively for P~gal 
activity (Figure 9D-I), confirming that cells that had generated HA coats had 
successfully taken up DNA. HA coats were destroyed by treatment with 

1 5 Streptomyces hyaluronidase (Figure 9H) or bovine testicular hyaluronidase. 
Control pCIneo transfected cells produced no coats (Figure 9C), and were 
indistinguishable from parental untransfected COS-1 cells. Equivalent numbers 
of LacZ positive cells were observed in experimental and control transfections. 
These results indicate that parental COS-1 cells express all other factors 

20 required for HA biosynthesis and pericellular coat formation, but most likely 

lack HA synthase activity. Thus, expression of Has2 in COS-1 cells is sufficient 

for HA coat formation. 

Discussion 

Residues demonstrated to be critical in terms of the P 1 -4 
25 glycosyltransferase activity of yeast Chs2 were conserved in mouse Has2, mouse 
Hasl, Streptococcal HasA, Xenopus DG42 and Rhizobium NodC. Thus, it is 
likely that mouse Has proteins have p 1-4 glycosyltransferase activity. 
Furthermore, although overall sequence identity between mouse Has2 and 
Streptococcus pyogenes HasA was only 21%, a 180 amino acid region within the 
30 predicted intracellular loop (residues 182 to 361) was highly conserved. This 
region exhibited 54% similarity between mouse Has2 and bacterial HasA, and 
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greater than 80% similarity between mouse Has2, mouse Hasl, and Xenopus 
DG42. This level of sequence conservation suggests that these proteins are 
functionally related. 

Sequence analyses predicted that mouse Has2 and Has3 encode a 
5 membrane protein with multiple transmembrane domains, similar in structure to 
the bacterial HasA protein and mouse Hasl. Significantly, four consensus 
binding sites for HA were identified in Has2, three of which were predicted to be 
intracellular. These sites may thus represent areas of potential binding of HA 
chains during elongation, and/or may represent sites at which the newly 

1 0 synthesized HA polymer remains attached prior to release from the cell. In 
addition to putative HA binding sites, numerous consensus sequences for 
phosphorylation by PKC and cAMP-dependent kinases were identified within 
the predicted intracellular loop of the molecule. This is significant, as 
mammalian HA biosynthesis has been shown to be dependent on activation by 

15 PKC, and suggests that the PKC dependence may partly involve direct activation 
of Has2 through phosphorylation. 

HA-dependent pericellular coats have been proposed to form through two 
alternate mechanisms. The first mechanism is HA receptor-dependent and HA 
synthesis independent. This type of coat can form through association of HA 

20 with cell surface HA receptors, and stabilization of the coat by association of HA 
binding proteoglycans, such as aggrecan and link protein (Lee et al M J. Cell 
Biol. . 123 . 1 899 (1993); Knudson et al., Proc. Natl. Acad. Sci. USA . 90, 4003 
(1993)). Presumably, this permits cells expressing HA receptors to enter an 
environment rich in HA, and to organize an HA matrix around themselves that is 

25 independent of the ability to synthesize HA. 

The second mechanism is HA receptor independent, and requires the 
synthesis and extrusion of HA through the plasma membrane. It has been 
proposed that the extruded HA associates with the membrane through continued 
attachment to the synthase, and that this coat is stabilized by HA-HA and HA- 

30 protein bridges (Heldin et al., Exp. Cell Res. . 208, 422 ( 1 993)). 
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Expression of mouse Has2 by COS-1 cells resulted in the formation of 
large well-pronounced HA coats, as determined by a particle exclusion assay 
(Figure 9). Previous studies in COS cells have shown that transfection of the 
HA receptor, CD44, and the addition of exogenous HA (15 (ig/ml) and 
5 proteoglycans to the medium was required for HA-dependent pericellular matrix 
formation (Knudson et al., Proc. NatL Acad. Sci. IJSA T 2Q, 4003 (1993)). In 
contrast, the studies described hereinabove demonstrate that expression of mouse 
Has2 in COS cells, in the absence of HA receptor expression, exogenously added 
HA, or proteoglycans, was sufficient for HA coat formation. This suggests that 

1 0 Has2 expression leads to the synthesis of HA, which is extruded through the 
plasma membrane and may associate with the cell surface to form an HA coat 
through continued attachment to the synthase. In this respect, the consensus HA 
binding motifs predicted within mouse Has2 may play an important role. 

HA biosynthesis requires two enzyme activities; the transfer of UDP-N- 

1 5 acetylglucosamine (UDP-GlcNAc) and UDP-glucuronic acid (UDP-GlcU A), 
respectively, to the growing HA chain (Philipson et al., Biochemistry 24 . 7899 
(1985)). In S. pyogenes, a single enzyme, HasA, carries out both activities. In 
contrast, recombinant Xenopus DG42 protein can synthesize short chitin 
oligomers from UDP-GlcNAc in vitro, but cannot synthesize a hyaluronan chain 

20 in the presence of UDP-GlcNAc and UDP-GlcUA (Semino et al., Proc. Natl. 
Acad. Sci. USA . 22, 3498 (1995)). This suggests that eukaryotic HA synthesis 
requires DG42-like activity and a second enzyme activity provided by a separate 
protein. 

Example 3 

25 cDNA Cloning and Characterization of Human Hyaluronan Synthase-2 and 

Mouse and Human Hyaluronan Synth ase-3 

Using degenerate PCR primer pair DEG 1 and DEG 5, described in 

Example 1, PCR products of approximately 300 bp were amplified from human 

and mouse total genomic DNA. The templates for PCR were 100 ng of human 

30 T47D mammary carcinoma cell line genomic DNA, and 100 ng of mouse 129 

Sv/J genomic DNA. Cycling parameters were as follows: 35 cycles of 94°C for 

10 seconds, 50°C for 30 seconds, and 72°C for 1 minute, followed by a final 



BNSOOCID: <WO_pe00661A2JL> 



WO 98/00551 



PCT/US97/I176I 



47 

extension step at 72°C for 10 minutes. Amplified fragments of the expected size 
were identified through agarose gel electrophoresis, gel-purified, and cloned 
directly as described in Example 1 . 

Two additional degenerate oligonucleotide primer pools (DEG 10 and 
5 DEG 1 1 ) were designed, based upon the conserved amino acid sequences 
GWGTSGRK (SEQ ID NO:20) and RWLNQQTRW (SEQ ID NO:33) (see 
Figure 14). Similar PGR conditions were used to amplify fragments of the 
expected size from human and mouse genomic DNA using these degenerate 
primers. Amplified PCR products were gel-purified and ligated directly into a 

10 cloning vector for sequence analyses. 

Sequences obtained from the clones fell into two groups in both the 
mouse and human. One group of human clones, represented by SEQ ID NO:23, 
shared 88% sequence identity with the equivalent region of mouse Has2 (SEQ 
ID NO:l) (Figure 10C), and was 100% identical at the amino acid level to SEQ 

15 ID NO:2 (Figure 10D). Thus, SEQ ID NO:23 represents a partial nucleotide 
sequence of human Has2. A human fetal lung expressed sequence tag (EST) 
(Genbank Accession No. W21505) shares approximately 90% nucleotide 
sequence identity with SEQ ID NO:l, and close to 100% amino acid identity to 
the predicted carboxy-terminal end of SEQ ID NO:2. 

20 The second group of clones obtained through degenerate PCR, although 

clearly related to Has2 and Hasl, were unique. The genes present in these clones 
has been designated Has3 (Figure 1 1). The mouse and human Has3 genes share 
93% nucleotide identity (SEQ ID Nos. 26 and 25, respectively) and 99% amino 
acid identity (SEQ ID Nos. 28 and 27, respectively). 

25 Based upon the sequence of these partial fragments, a single pair of 

oligonucleotide primers, forward 5'-TAC TGG ATG GCT TTC AAC GTG 
GAG-3' (corresponding to nucleotides 790 to 813, SEQ ID NO:34, Figure 12B), 
and reverse 5'-GTC ATC CAG AGG TGG TGC TTA TGG-3 ' (corresponding to 
antisense complement of nucleotides 1 142 to 1119, SEQ ID NO:37, Figure 12B) 

30 were employed to facilitate PCR screening of a mouse 129Sv PI genomic library 
(Genome Systems, St. Louis, MO). Three positive Pt clones were obtained. 
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The restriction fragments spanning the entire mouse Has3 gene were identified, 
the inserts comprising the fragments subcloned into pBluescript (Stratagene, La 
Jolla, CA) based vectors and the inserts sequenced. 

To confirm the sequence obtained from the analysis of genomic clones, 
5 the Has 3 cDNA was obtained. The cDNA was cloned by reverse-transcriptase 
polymerase chain reaction (RT-PCR) amplification. The template for the 
reaction was total RNA from late gestation (17.5 days post-coitum) mouse 
C57BL/6J embryos. First-strand cDNA synthesis was performed as described in 
Example 1 using the mouse Has3 reverse oligonucleotide primer. 

10 First-strand cDNAs were PCR amplified using standard PCR buffer 

conditions supplemented with 2% deionized formamide, through 35 cycles of 
94°C for 10 seconds, 65°C for 30 seconds, and 72°C for 2 minutes, followed by 
a final extension step of 72°C for 1 0 minutes. Oligonucleotide primers 
possessed EcoRI restriction endonuclease sites (underlined) at their 5 ' termini to 

15 facilitate subsequent cloning steps. These oligonucleotides included: forward, 
V-CC GAATTCA AG ATG GCG GTG CAG CTG ACT AC A Gee- 
s' (corresponding to nucleotides 1 to 24, SEQ ID NO:38, Figure 12B), and 
reverse, 5' CC GAATTC TCA CAC CTC CGC AAA AGC CAG GC- 
3 '(corresponding to the antisense complement of nucleotides 1665 to 1643, SEQ 

20 ID NO:39, Figure 12B). Amplified cDNAs of the expected size were gel- 
purified and cloned. All sequence analyses were performed using the Genetics 
Computer Group (GCG) package, and MacVector programs. 

The open reading frame (ORF) encoding mouse Has3 is 1662 bp (SEQ 
ID NO:31) (Figure 12B). This ORF encodes a polypeptide of 554 amino acids 

25 (SEQ ID NO:32) with a predicted molecular mass of 63.3 kDa. This polypeptide 
is only 2 amino acids longer than the mouse Has2 polypeptide. Sequence 
alignments indicated that mouse Has3 is 71%, 57%, 56%, and 28% identical to 
mouse Has2, mouse Hasl (HAS protein), Xenopus DG42, and Streptococcus 
pyogenes HasA, respectively (Figure 13 A). Like Hasl and Has2, residues 

30 demonstrated to be critical for N-acetylglucosaminyltransferase activity of yeast 
chitin synthase 2 are completely conserved. In addition, these residues are 
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conserved with members of a recently identified putative plant cellulose synthase 
family (Pear et al.. Proc. Natl. Acad. Sci. USA . 93. 12637(1996)) (Figure 13B). 

Alignment of the partial sequence of human has3 (HAS3 hereinafter) and 
mouse Has3 (Has3 hereinafter) indicated a very high level of sequence 
5 conservation (99%) (Figure 12A). This is similar to the high level of 

conservation observed for human and mouse HAS1 (96%) and HAS2 (99%). 

Hydrophilicity plots suggested that Has3 is very similar in structure to 
Has2 and Hasl, and predicted the presence of multiple transmembrane domains, 
with two at the N-terminus and a cluster at the C-terminus (Figure 14C). 
10 Significantly, like Has2 and Hasl, the Has3 sequence predicts the presence of 
several potential HA binding motifs defined by the consensus B (X 7 )B 
(underlined in Figure 12B). Furthermore, these motifs are located at similar 
positions within the Has3 polypeptide. 

Example 4 

1 5 Molecular Biochemical Characterization of Mouse Has3 

Northern Analysis . To determine the temporal expression pattern of 
mouse Has3 in the developing mouse embryo, Northern blot analysis was 
employed. The mouse Has3 ORF cDNA was labeled with [a 32 P]dCTP by 
random priming (Feinberg and Vogelstein, Anal. Biochem. . 132 . 6 (1984)) and 

20 hybridized to a Northern blot of mouse embryo messenger RNA (CLONTECH, 
Palo Alto, CA) under conditions recommended by the manufacturer. The results 
showed that, in contrast to mouse Has2 which is highly expressed from as early 
as day 7.5 post-coitum through late gestation in the developing mouse embryo, 
mouse Has3 is expressed predominantly in the late gestation embryo (Figure 1 3). 

25 One major transcript of approximately 6.0-6.5 kb and a minor transcript of 
approximately 4.0 kb were observed (Figure 13). 

Transfection Studies . The mouse Has3 ORF was cloned into the EcoRI 
site of the expression vector pCIneo (Promega, Madison, WI). To test the 
enzyme activity of mouse Has3, the mouse Has3 expression vector was co- 

30 transfected with a pCMV p-gal vector into COS-1 (SV40-transformed African 
green monkey kidney) cells using LipofectAMINE™ (Life Technologies Inc., 
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Gaithersburg, MD), according to the manufacturer's instructions. Positive 
control transfections utilized the mouse Has2 expression vector described above. 
HA coat assays and detection of p-galactosidase activity were performed as 
described in Example 2. 
5 pCIneo (vector only control) transfected cells failed to produce coats 

(Figure 15B). Mouse Has3 transfected cells produced pericellular coats that 
were destroyed by treatment with a specific hyaluronidase from Streptomyces 
(5 TRU/ml for 1 hour at 37°C) (compare panels E, before hyaluronidase 
treatment, and F, after hyaluronidase treatment, in Figure 15). In contrast, 
10 pericellular coats remained on mock hyaluronidase treated cells (compare panels 
C, before, and D, after mock hyaluronidase treatment in Figure 15). Thus, the 
data showed that expression of mouse Has3 in COS-1 cells resulted in the 
generation of well-pronounced HA-dependent pericellular coats, as previously 
observed for Has 2. 

15 To confirm the HA biosynthetic capability of Has3 transfected cells, HA 

synthase assays were performed on crude membranes prepared from these cells. 
Crude cell membrane preparations were isolated as described by Becq et al. 
f Proc. Natl. Acad. Sci. USA. 21, 9160 (1994)), except the final membrane pellets 
were resuspended in 50 \x\ of lysis buffer (LB) consisting of 10 mM KC1, 1 .5 

20 mM MgCI 2 , and 10 mM Tris-HCl pH 7,4 plus protease inhibitors (aprotinin, 
leupeptin and phenylmethylsulfonyl fluoride) (LB+). Protein content of crude 
membrane preparations was determined by a BCA assay (Pierce, Rockford, IL). 
To detect HA synthase activity, duplicate samples of approximately 100 \xg 
crude membrane protein were incubated overnight at 37°C in a total reaction 

25 volume of 200 jil under the following conditions: 5 mM dithiothreitol, 1 5 mM 
MgCl 2 , 25 mM HEPES pH 7.1, 1 mM UDP-GlcNAc, 0.05 mM UDP-GlcUA, 
0.4 \ig aprotinin, 0.4 |ig leupeptin, 0.5 \xd UDP-[ I4 C]GlcUA (ICN, Costa Mesa, 
CA). An additional specificity control reaction was set up in which UDP- 
GlcNAc was omitted. After overnight incubation, samples were boiled for 10 

30 minutes, and subsequently divided in two equal portions. Streptomyces 
hyaluronidase (1 turbidity reducing unit (TRU)) was added to one half and 
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incubated for an additional hour at 37°C. SDS was added to a final 
concentration of 1%, samples were boiled and analyzed by descending paper 
chromatography essentially as described in DeAngelis and Weegel, 
Biochemistry . 21, 9033 (1994). 
5 These assays indicated that crude membranes prepared from either Has3 

or Has2 transfected COS-1 cells were capable of converting UDP-[ ,4 C]GlcUA 
into significant amounts of a high molecular weight product only in the presence 
of UDP-GlcNAc (Table 2). Furthermore, this product could be specifically 
degraded by Streptomyces hyaluronidase (Table 2). Thus, in COS-1 cells, Has2 
1 0 and Has3 appear to possess similar enzymatic activities. 

TABLE 2 



Hyaluronan Synthase Activity of Transfected COS-1 Cells 



Vector 


+ UDP-GlcNAc» 


- UDP-GlcNAc 


Hyaluronidase b 


Mouse Has3 


204.2 C 


1.9 d 




pCIneo 


65.0 


2.2 


+ 


Mouse Has2 


26.9 


2.5 




pCIneo 


10.5 


2.0 


+ 


pCIneo (control) 


11.0 


ND e 






10.3 


ND 


-t- 



20 a Plus and minus symbols indicate whether or not UDP-GlcNAc was included 
in these reactions. 

b Plus and minus symbols indicate whether or not a reaction was subsequently 
treated for 1 hour at 37°C with 1 TRU Streptomyces hyaluronidase prior to 
paper chromatography. 
25 c Numbers represent picomoles radiolabeled product formed and were 

calculated taking into account the specific activity of the UDP[ ,4 C]-GlcUA 
used, the amount of cold UDP-GlcUA per reaction, and assumed a 
scintillation counting efficiency of >95%. Based upon these calculations, 1 
picomole of radiolabeled product is represented by 384 disintegrations per 
30 minute (dpm), i.e., 204.2 picomoles product was calculated from 78, 413 
dpm. Numbers represent the mean calculated from duplicate reactions. 
d Number represents the result of a single reaction in each instance. 
c Not determined. 

35 
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Discussion . The three Has proteins are encoded by three separate but 
related genes, which constitute a mammalian HAS gene family. Sequence 
comparisons and structural predictions suggest that the mammalian HAS 
proteins are very similar in structure. They are predicted to have one or two N- 
5 terminal transmembrane domains and a cluster of C-terminal transmembrane 
domains separated by a large cytoplasmic loop. This topology is extraordinarily 
similar to that predicted for the bacterial HA synthase, Has A (Helderman et al„ 
Glvcobiologv. £, 741 (1996)), and to that recently reported for the Rhizobium 
meliloti nodulation factor, NodC (Barny et al., Molec. Microbiol. . 19 . 443 

10 (1 996)). In addition, the mammalian HAS sequences, the Xenopus DG42 

sequence, HasA sequence, NodC sequence, and the recently reported putative 
plant cellulose synthases share critical residues shown to be required for N- 
acetylglucosaminyltransferase activity of yeast chitin synthase 2, making it 
highly likely that all these proteins are functionally related processive P- 

1 5 glycosyltransferases. The highly conserved aspartate residues may represent 
sites such as cation binding sites that in turn may coordinate nucleotide-sugar 
interaction with the enzyme. 

While Semino and Robbins have postulated that DG42 and its related 
mammalian homologs, rather than being bona fide HA synthases, may stimulate 

20 HA production through synthesizing chitin oligosaccharide primers, which are 
required for and rate limiting for eukaryotic HA biosynthesis ( Proc. Natl. Acad. 
Sci. USA . 93 , 4548 (1996)), ceil membranes isolated from baker's yeast, 
Saccharomyces cerevisiae^ engineered to express DG42 have HA synthesis 
activity in vitro when supplied with the required UDP-precursors (DeAngelis 

25 and Achyuthan, J. Biol. Chem. . 27L 23657 (1996)) since S, cerevisiae is 

deficient in UDP-glucuronic acid production, S. cerevisiae is incapable of HA 
biosynthesis. 

Expression of any one of the mammalian HAS proteins in transfected 
mammalian cells leads to a dramatic increase in HA biosynthesis. This would 
30 suggest that the proteins have similar activities. However, the high degree of 
sequence conservation (96-99% identity) between human and mouse HA 
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synthases contrasts with the lower level of identity between synthases within a 
species (Hasl/Has2, 55% identity; Hasl/Has3, 57% identity; Has2/Has3, 71% 
identity), arguing for evolutionary conservation of functionally important 
residues, and for some differences in the mode of action of the three proteins. 
5 Potential differences in function of the proteins could relate to the length of the 
HA chain synthesized, the rate of HA synthesis, the ability to interact with cell- 
type specific accessory proteins, and whether or not the HA is preferentially 
secreted by the cell or alternatively retained by the cell in the form of a 
pericellular coat. 

10 Example S 

Identification of the Chromosomal Location of the Has Genes 
To determine the chromosomal location of the mouse Has genes, a panel 
of DN A samples, from an interspecific cross that has been characterized for over 
2,000 genetic markers throughout the mouse genome, was analyzed. The genetic 

15 markers included in this genetic map span between 50 and 80 centi-Morgans 

(cM) on each mouse autosome and the X chromosome (Chr), and the mapping of 
the reference loci in this interspecific cross are indicated with citations in an 
online database (data can be accessed through the internet as follows: 
http://www.informatics.jax.org/crossdata.html to enter the DNA Mapping Panel 

20 Data Sets from the Mouse Genome Database (MGD), then select the Seldin cross 
and Chromosome). 

Initially, DNAs from two parental mice [C3H/HeJ-g/d and (C3H/HeJ-gW 
x Mus spretus)Fl] were digested with various restriction endonucleases and 
hybridized with probes specific to mouse Hasl, Has2 and Has3 to determine 

25 restriction fragment length variants (RFLVs) to allow haplotype analyses. The 
223 bp mouse Hasl probe was generated through PCR amplification of a full- 
length mouse Hasl cDNA template using oligonucleotide primers, 
5 'GTC AGAGCTACTTCC ACTGTG3 ' (SEQ ID NO:53) and 
5 ' AAGGAGGAGGGCGTCTCCGAG3 ' (SEQ ID NO:54) (nt positions 947-967 

30 and 1 169-1 149, respectively). The mouse Has2 probe was the MHas300 partial 
cDNA (Figure 2), and the mouse Has3 probe was an equivalent fragment of the 



BN8DOCID:<WO ?900661A2JL> 



WO 98/00551 



PCT/US97/11761 



54 

mouse Has3 gene, generated using degenerate PCR primers as described above 
(Example 1). For each gene, informative RFLVs were detected: Has I using 
BamHI restriction endonuclease, C2H/HeJ-#W, 18.0 kb, 6.8 kb; Mus spretus, 2.1 
kb; Has2 using TaqI restriction endonuclease, C3U/HeJ-gld, 3.7 kb; Mus 
5 spretus, 3.9 kb; Has 3 using Mspl restriction endonuclease, C3H/HeJ-g/</, 1 .3 kb, 
4.2 kb; Mus spretus^ 3.2 kb. 

Comparison of the hapotype distribution of the Has RFLVs indicated that 
these genes segregated to three different mouse autosomes; Has! to mouse Chr 
17, Has 2 to mouse Chr 15, and Has 3 to mouse Chr 8. The best gene order ± the 

10 standard deviation (Green, In: Genetics and Probability in Animal Breeding 
Experiments (E. Green, ed.), MacMillan, NY, pp. 77-1 13 (1981)) indicated the 
following gene orders: on mouse Chr 1 7 (centromere) Thbsl - 0.9 cM ± 0.9 cM 
- Has I - 3.5 cM ± 1.7 cM - Hsp84-1\ on mouse Chr 15 (centromere) Dhfr-rsI - 
1 4.0 cM ± 3.3 cM - Has2 - 0.9 cM ± 0.9 cM - Myc; and on mouse Chr 8 

1 5 (centromere) Mil - 5.3 cM ± 2.1 cM - D8MU242 - 0.9 cM ± 0.9 cM - 
Has3/D8MU12 - 1 1 .4 cM ± 3.0 cM - D8MU154. 

Pairwise sequence alignments of mouse Has cDNAs with human HAS 
cDNAs permitted the design of oligonucleotide primer pairs specific for the 
respective human HAS genes. Human HAS1 : HAS IF 

20 5 'GTGCTTCTGTCGCTCTACGCG3 ' (SEQ ID NO:49) and Human HAS 1 R 
5 'CC AGTCCC AATATAGTCC AGACTG3 ' (SEQ ID NO:50) (nt positions 
1410-1431 and 1940-1917, respectively, (Shyjan etal., J. Biol. Chem. . 271 . 
23395 (1996)) which amplified a 520 bp fragment. Human HAS2: HAS2F 
5 'GGTGTGTTCAGTGCATTAGTGGA3 ' (SEQ ID NO:51) and HAS2R 

25 5 TAGCCATCTGAGATATTCTATAGGT3 '(SEQ ID NO: 52) (nt positions 
1359-1382 and 1579-1555, respectively, Watanabe and Yamaguchi, J. Biol. 
Chem. . 271 . 22945 (1996)) which amplified a 220 bp fragment. Human HAS 3: 
HAS3F 5 'TGTGC AGTGTATT AGTGGGCCCT3 ' (SEQ ID NO:41) and 
HAS3R 5'GTTGAGCCACCGGAGGTACTTAG3' (SEQ ID NO:43) which 

30 amplified a 220 bp fragment. Conditions used in all PCR. reactions were: 0.2 
mM each dNTP, 50 mM KCl, 10 mM Tris-HCl pH 8.3, 1.5 mM MgCl 2 , 2% 
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deionized forrnamide, 0.25 U Taq polymerase (Boehringer Mannheim), primers 
at 0.4 |j.M, 1 00 fj.1 reactions. Cycling parameters for each primer pair were as 
follows: 35 cycles of 94°C for 10 seconds, 67°C (HAS1), 63°C (HAS2), or 
65°C (HAS3) for 30 seconds, and 72°C for 1 minute, followed by a final 
5 extension step at 72°C for 10 minutes. 

The oligonucleotide primers were used to screen two somatic cell hybrid 
mapping panels (Coriell Institute, Camden, NJ) segregating human 
chromosomes on a mouse or hamster background. Using this approach, the 
human HAS genes were unequivocally assigned to human Chr 19 (HASJ\ Chr 8 

1 0 (HAS2X and Chr 1 6 (HAS3). 

To refine the location of human HAS I on Chr 19, the PCR fragment 
described above was used as a probe to screen colony filters of a Chr 19 cosmid 
library (Olsen et al., Genomics . 23 . 659 (1994)). Two positive clones, R30674 
and F21 560, were identified, neither of which had been incorporated into any of 

15 the previously assembled contigs constituting the Chr 19 map (Ashworth et al., 
Nat. Genet. . JUL, 422 (1965)). Alu-PCR products (Parrish et aL, Am. J. Hum. 
Genet. . 57(51 267 (1995)) from clone F21560 were hybridized to the cosmid 
library and to a genomic Bacterial Artificial Chromosome (BAC) library 
(Shizuya et al., Proc. Natl. Acad. Sci. USA. 22, 8794 (1992)) to form a contig 

20 around the HAS1 gene. The probe identified several additional cosmids that 
were members of a previously assembled contig (CT1665), which had been in 
situ mapped to I9ql3.3, as well as two BACs (BC79672 and BC56224) which 
extended the HAS1 contig in the opposite direction from CT1665. Alu-PCR 
products from BC56224 were hybridized to cosmids and identified the HAS1 

25 cosmids in addition to numerous clones from another previously assembled 

contig (CT1 03 1 ). Clone D 1 852 from this contig has been incorporated into the 
high resolution pronuclear FISH map of human 19q, placing HAS1 at the ql3.3- 
13.4 boundary, within the approximately 400 kb region between ETFB 
(Electron-Transferring-Flavoprotein, Beta polypeptide) and FPR1 (Formyl 

30 Peptide Receptor 1). EcoRI mapping confirmed the clone overlaps detected by 
hybridization and indicated a size of 286 kb for the extended HAS 1 contig. In 
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addition to the above mapping results, the localization of HAS 1 to Chr 19ql3.3- 
13 4 was confirmed using a 2.1 kb human HAS1 cDNA (Itano et al., BBRC . 
222 . 816 (1996)) and FISH analysis, as described in Inazawa et al. ( Genomes . 
17 , 1 53 (1993)). The mapping results for mouse Has I and human HAS1 
5 reinforce the recently reported relationship between a small region of human 1 9q 
and mouse Chr 17. 

The position of Has2 on proximal mouse Chr 1 5 suggested that the 
human homolog, HAS2 9 is located on the long arm of human Chr 8 at band q24.1 
(DeBry and Seldin, Genomics . 22, 337 (1996) and online database: 

10 http://www3.ncbi.nlm.nih.gov/Homology/). This location corresponds to the 
region predicted to contain the gene for the human Langer-Giedion syndrome 
(LGS) (Chen et al., Genomics . 22, 1 17 (1996)), a contiguous genetic syndrome 
characterized by craniofacial deformities, multiple exostoses, mental retardation, 
microcephaly, and redundant skin (Bauermeister and Letts, Ortho. Rev. . 21 . 31 

1 5 (1 962)). To refine the location of human HAS2 on Chr 8, the human HAS2 
primers were used to PCR screen the following human-hamster somatic cell 
hybrids: CL-17, 3;8/4-i, MC2F, 21q+, and TL/UC (Parrish et al., Som. Cell 
Molec. Genet. . 2SL 143 (1994); Wagner et al., Genomics , ifl, 1 14 (1991)). 
Positive PCR signals were observed for CL-17, 2Iq+ and 3;8/4-l in addition to 

20 total human DNA, sublocalizing the HAS2 gene to the q arm in interval 1-8 
(Spurr et al., Cvtogenet. Cell Genet. . 147 (1995)). Human HAS2 primers 
were further screened against YACs within the distal portion of a large Y AC 
contig (Chen et al., supra). This contig extends from interval 1-1 into interval I- 
9. Only three of the YACs tested were positive, narrowing the location of HAS2 

25 to the overlapping region between these YACs. This places the human HAS2 
gene at human Chr 8q24.12, close to the DAP-A1 gene, and between the defined 
critical region for the Langer-Giedion syndrome (LGS) and the MYC gene. 
Thus, HAS2 can be excluded as a candidate gene for LGS. 

The localization of the mouse Has3 gene to mouse Chr 8 near the 

30 D8MU12 locus implicated human Chr 16q as the most likely location for the 

human homolog of this gene. To confirm and refine this localization YAC DNA 
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pools from a YAC map of human Chr 1 6 (Daggett et al., Natyire . 377CSV 335 
(1995)) were screened with DNA primers that were specific for the human HAS3 
gene, as described above. Three YACs (My782G9, My703C5, and My878A4) 
were identified which produced an amplimer of the correct size with these 
5 primers. These results place the HAS3 gene in band 16q22.1 between the 
somatic cell hybrid breakpoints CY1 27(D) and CY6, and near the E-cadherin 
gene (CDH1) gene and the D16S496 marker. 

All publications and patents are incorporated by reference herein, as 
though individually incorporated by reference. The invention is not limited to 
10 the exact details shown and described, for it should be understood that many 
variations and modifications may be made while remaining within the spirit and 
scope of the invention defined by the claims. 
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SEQUENCE LISTING 

(1) GENERAL. INFORMATION 
(i) APPLICANT: Mayo Foundation for Medical Education and Research 

(ii) TITLE OF THE INVENTION: GENE ENCODING HYALNURONAN 

SYNTHASE 

(iii) NUMBER OF SEQUENCES: 57 

(iv) CORRESPONDENCE ADDRESS: 



(A 
(B 
(C 
(D 
(E 
(F 



(v) COMPUTER READABLE FORM: 



(A 
(B 
(C 
(D 

(vi) 
(A 
(B 
(C 

( vii 
(A 
(B 

(A 
(B 



(vii 
(A 
(B 
(C 

(ix) 
(A 
(B 
(C 



ADDRESSEE: Schwegman, Lundberg, Woessner & Kluth, P. A. 

STREET: P,0. Box 2 938 

CITY: Minneapolis 

STATE: MN 

COUNTRY: U.S. A 

ZIP: 55402 



MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2.0 

CURRENT APPLICATION DATA: 
APPLICATION NUMBER: 08/812,008 
FILING DATE: 05 -MAR- 1997 
CLASSIFICATION: 

PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 08/812,008 
FILING DATE: 05-MAR-1997 

APPLICATION NUMBER: 08/675,4 99 
FILING DATE: 03-JUL-1996 



) ATTORNEY /AGENT INFORMATION: 
NAME: Embretson, Janet E 
REGISTRATION NUMBER: 39,665 
REFERENCE /DOCKET NUMBER : 150 . 183US1 

TELECOMMUNICATION INFORMATION: 
TELEPHONE : 612-373-6900 
TELEFAX: 612-339-3 061 
TELEX : 



(2) INFORMATION FOR SEQ ID NO : 1 

( i ) S EQUENCE CHARACTER I S TI CS : 
(A) LENGTH: 2947 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ACATGTAAGA AGAAGGAGAA GTCAAGGCGT CTGGAAAGAA TTACCCAGTC CTGGCTTCGA 60 

GCAGCCCATT GAACGGGGGA CTTGAACCAG CCAAAGACTT CTTCATTCTG CTCTTGCTAG 120 

ACTCTG CTG A GTCTTGACCC GGCTTGTAGG TTGATGTGAA AAGAGATTTT GTGTCGTCGG 18 0 

AGGGAAGGGG ATTGGAGCAA ATAGCAAAAC AGGGGGAAAA GTTAATTTAT CTTTAAAGCA 240 

GATATAACAA AGAATTAGAA GACTTAAGTG CAGCGGAAAT ATAAAGAGAA TATTAGTGAA 300 

ATTTCTTCTC AAAGAGGGGA GAACCAAGCA TTTAAGGCTC CCCCATCTTT TTTTTTAAAT 3 60 

GTTGTTTTTA AATTTCTTAT TTTTTTTGGC CGGTCGTCTC AAATTC AT CT GATTTCTTAT 42 0 

TACCTCAATT TTGGAAACTT CCTTCCACGA CCCTCCGGGA CCACACAGAC AGGCGGAGGA 4 80 

CGAGTCTATG AGCAGGAGCT GAACAAGATG CATTGTGAGA GGTTTCTATG TGTCCTGAGA 54 0 

ATAATTGGAA CTACACTTTT TGGAGTGTCT CTCCTCCTCG GAATCACAGC TGCTTATATT 600 

GTTGGCT AC C AGTTTATCCA AACAGATAAT TACTACTTCT CATTTGGACT GTACGGTGCC 660 

TTTTT AG CCT CGCATCTCAT CATCCAAAGC CTCTTTGCCT TTTTGGAACA CCGGAAAATG 72 0 

AAGAAGTCCC TTGAAACCCC GATTAAATTG AACAAAACGG TAGCACTCTG CATCGCTGCG 78 0 

TACCAAGAGG ACCCTGACTA CTTACGGAAA TGTTTGCAAT CTGTGAAAAG GCTGACCTAC 84 0 

CCTGGGATTA AAGTCGTGAT GGTCATCGAT GGGAACTCAG ACGACGACCT TTACATGATG 900 

GACATATTCA GCGAAGTTAT TGGCAGGGAC AAATCGGCCA CGTACATCTG GAAGAACAAC 96 0 

TTTCATGAAA AGGGACCTGG TGAGACAGAA GAGTCCCATA AAGAAAGTTC ACAACATGTC 1020 

ACCCAATTGG TCTTGTCTAA CAAAAGTATT TGCATCATGC AAAAATGGGG TGGAAAGAGA 10 8 0 

GAAGTCATGT ACACAGCCTT CAGAGCACTG GGGCGAAGCG TGGATTATGT ACAGGTGTGT 114 0 

GACTCAGATA CTATGCTTGA CCCTGCCTCA TCTGTGGAGA TGGTGAAGGT CTTAGAGGAA 12 0 0 

GACCCTATGG TTGGAGGTGT TGGAGGAGAT GTCCAGATTT TAAACAAGTA TGATTCCTGG 126 0 

ATCTCCTTCC TCAGCAGCGT GAGATACTGG ATGGCTTTTA ATATAGAAAG GGCCTGCCAG 13 2 0 

TCTTATTTTG GCTGTGTCCA GTGCATAAGC GGTCCTCTGG GAATGTACAG AAACTCCTTG 13 80 

CTGCATGAAT TTGTGGAAGA CTGGTACAAT CAGGAATTCA TGGGTAACCA ATGCAGTTTT 144 0 

GGTGACGACA GGCACCTTAC CAACAGGGTG TTGAGTCTGG GCTATGCAAC TAAATACACG 1500 

GCTCGGTCCA AGTGCCTTAC TGAAACTCCC ATAGAATATC TGAGATGGCT GAACCAGCAG 156 0 

ACCCGATGGA GCAAGTCCTA CTTCCGAGAG TGGCTGTACA ATGCCATGTG GTTTCACAAG 16 2 0 

CATCACCTGT GGATGACCTA TGAAG CTGTT ATCACTGGAT TCTTTCCTTT CTTTCTCATT 16 80 

GCCACAGTCA TCCAGCTCTT CTACAGGGGT AAAATCTGGA ACATCCTCCT CTTCCTGTTA 174 0 

ACTGTCCAGC TAGTGGGTCT CAT CAAGTC A TCTTTTGCCA GCTGCCTTAG AGGAAATATC 18 00 

GTCATGGTAT TCATGTCTCT GTATTCAGTG TTATACATGT CAAGTCTACT TCCTGCCAAG 1860 

ATGTTTGCAA TTGCAACCAT AAACAAAGCT GGGTGGGGCA CATCTGGAAG GAAGACCATT 192 0 

GTTGTTAATT TCATAGGACT TATTCCAGTG TCCGTGTGGT TTACAAT CCT TCTAGGTGGT 198 0 

GTAATTTTCA CCATTTATAA GGAATCTAAA AAGCCATTTT CCGAATCCAA ACAGACTGTT 2 04 0 

CTCATCGTGG GAACTTTGAT CTATGCATGC TACTGGGTCA TGCTTTTGAC TCTCTATGTG 210 0 

GTTCT CATC A ATAAGTGTGG CAGGCGGAAG AAGGGACAAC AGTATGACAT GGTGCTTGAT 2160 

GTATGATGAT GTTTGTAGTC ACACCTGGAG ACACACACAC ACACACATCA CACACACACA 222 0 

CACCTTAGCT CCTCAAGGGG CTATACAGTA TTGTGGCACC GCACCCTGCC ACCACAGGAG 22 8 0 

ACATATCACT GCTGCTGGGA CTTGAACAAA GACATTCAAT GGGGGTTGGT TTCTTTTTTA 2 34 0 

TTCTGCCAAA GCAAATTGAT ACATCAGTGA GAAGAAAGTC CGATTAAATC TGACAGTTTT 24 0 0 

AGGACGGTGG GATGATGTCT TGG CTTATGC ACTTTTCCCT TACTGTGCAT CTGCCTGACA 2460 

GTGTTTGTTC TAAATACCTC ACTTGCCATG CTTTGTGTGG GTGATCATGG AAGAAAAGGA 2520 

TTCTGAAAAC TCAAGGGAAC GTTCTTTCAA CCTACACATC CTAACTTATG GACTCTTTTG 2580 

ATAGCTGATG ATTTTCTTTC TATTTTTTGT TTTTAAGGAA AATTGTTCAT CTTTACCAAA 2 64 0 

TGAAATGCCA AAGGAAAGTT GGAAAGCCAC TGGCTATGCT GTATTTTGAT ATAATAATTG 2700 

TACTGTGTTT TAAATTTTGT ATCCGGATTT TTAAAAACAA AATTTCACAC CATAGTCTAT 2760 

ATTTTACTTC TCTGGCAAAA TACACTTTTG TTCTTTTATA TATATATATA TATATATATA 2 820 
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ATAAAATAGG TTCTAAAAAA ATCCATACTA TAAAAAAAAA TTAACCTGCC CAAAATGTGA 2880 

AACGTGGTTG ACTGATGTTC ATGAAAGAAT AAAATGTTTC TCTCTTTCTC TACATTTTAA 2 94 0 

AAAAAAA 2 94 7 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 552 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met His Cys Glu Arg Phe Leu Cys Val Leu Arg lie lie Gly Thr Thr 

15 10 is 

Leu Phe Gly Val Ser Leu Leu Leu Gly lie Thr Ala Ala Tyr lie Val 

20 25 30 

Gly Tyr Gin Phe lie Gin Thr Asp Asn Tyr Tyr Phe Ser Phe Gly Leu 

35 40 45 

Tyr Gly Ala Phe Leu Ala Ser His Leu lie lie Gin Ser Leu Phe Ala 

50 55 60 

Phe Leu Glu His Arg Lys Met Lys Lys Ser Leu Glu Thr Pro lie Lys 
65 70 75 80 

Leu Asn Lys Thr Val Ala Leu Cys lie Ala Ala Tyr Gin Glu Asp Pro 

85 90 95 

Asp Tyr Leu Arg Lys Cys Leu Gin Ser Val Lys Arg Leu Thr Tyr Pro 

100 105 110 

Gly lie Lys Val Val Met Val lie Asp Gly Asn Ser Asp Asp Asp Leu 

115 120 125 

Tyr Met Met Asp lie Phe Ser Glu Val lie Gly Arg Asp Lys Ser Ala 

130 135 140 

Thr Tyr lie Trp Lys Asn Asn Phe His Glu Lys Gly Pro Gly Glu Thr 
145 150 155 160 

Glu Glu Ser His Lys Glu Ser Ser Gin His Val Thr Gin Leu Val Leu 

165 170 175 

Ser Asn Lys Ser lie Cys lie Met Gin Lys Trp Gly Gly Lys Arg Glu 

180 185 190 

Val Met Tyr Thr Ala Phe Arg Ala Leu Gly Arg Ser Val Asp Tyr Val 

195 200 205 

Gin Val Cys Asp Ser Asp Thr Met Leu Asp Pro Ala Ser Ser Val Glu 

210 215 220 

Met Val Lys Val Leu Glu Glu Asp Pro Met Val Gly Gly Val Gly Gly 
225 230 235 240 

Asp Val Gin lie Leu Asn Lys Tyr Asp Ser Trp lie Ser Phe Leu Ser 

245 250 255 

Ser Val Arg Tyr Trp Met Ala Phe Asn lie Glu Arg Ala Cys Gin Ser 

260 265 270 

Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr Arg 

275 280 285 

Asn Ser Leu Leu His Glu Phe Val Glu Asp Trp Tyr Asn Gin Glu Phe 

290 295 300 

Met Gly Asn Gin Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn Arg 
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305 310 315 320 

Val Leu Ser Leu Gly Tyr Ala Thr Lys Tyr Thr Ala Arg Ser Lys Cys 

325 330 335 

Leu Thr Glu Thr Pro lie Glu Tyr Leu Arg Trp Leu Asn Gin Gin Thr 

340 345 350 

Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr Asn Ala Met Trp 

355 360 365 

Phe His Lys His His Leu Trp Met Thr Tyr Glu Ala Val lie Thr Gly 

370 375 380 

Phe Phe Pro Phe Phe Leu lie Ala Thr Val lie Gin Leu Phe Tyr Arg 
385 390 395 400 

Gly Lys lie Trp Asn lie Leu Leu Phe Leu Leu Thr Val Gin Leu Val 

405 410 415 

Gly Leu lie Lys Ser Ser Phe Ala Ser Cys Leu Arg Gly Asn lie Val 

420 425 430 

Met Val Phe Met Ser Leu Tyr Ser Val Leu Tyr Met Ser Ser Leu Leu 

435 440 445 

Pro Ala Lys Met Phe Ala lie Ala Thr lie Asn Lys Ala Gly Trp Gly 

450 455 460 

Thr Ser Gly Arg Lys Thr lie Val Val Asn Phe lie Gly Leu lie Pro 
465 470 475 480 

Val Ser Val Trp Phe Thr He Leu Leu Gly Gly Val He Phe Thr He 

485 490 495 

Tyr Lys Glu Ser Lys Lys Pro Phe Ser Glu Ser Lys Gin Thr Val Leu 

500 505 510 

He Val Gly Thr Leu He Tyr Ala Cys Tyr Trp Val Met Leu Leu Thr 

515 520 525 

Leu Tyr Val Val Leu He Asn Lys Cys Gly Arg Arg Lys Lys Gly Gin 

530 535 540 

Gin Tyr Asp Met Val Leu Asp Val 
545 550 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 583 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



Met Arg Gin Asp Met Pro Lys Pro Ser Glu Ala Ala Arg Cys Cys Ser 

15 10 15 

Gly Leu Ala Arg Arg Ala Leu Thr He He Phe Ala Leu Leu He Leu 

20 25 30 

Gly Leu Met Thr Trp Ala Tyr Ala Ala Gly Val Pro Leu Ala Ser Asp 

35 40 45 

Arg Tyr Gly Leu Leu Ala Phe Gly Leu Tyr Gly Ala Phe Leu Ser Ala 

50 55 60 

His Leu Val Ala Gin Ser Leu Phe Ala Tyr Leu Glu His Arg Arg Val 
65 70 75 80 

Ala Ala Ala Ala Arg Arg Ser Leu Ala Lys Gly Pro Leu Asp Ala Ala 
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85 90 95 

Thr Ala Arg Ser Val Ala Leu Thr lie Ser Ala Tyr Gin Glu Asp Pro 

100 105 no 

Ala Tyr Leu Arg Gin Cys Leu Thr Ser Ala Arg Ala Leu Leu Tyr Pro 

115 120 125 

His Thr Arg Leu Arg Val Leu Met Val Val Asp Gly Asn Arg Ala Glu 

130 135 140 

Asp Leu Tyr Met Val Asp Met Phe Arg Glu Val Phe Ala Asp Glu Asp 
145 150 155 160 

Pro Ala Thr Tyr Val Trp Asp Gly Asn Tyr His Gin Pro Trp Glu Pro 

165 170 175 

Ala Glu Ala Thr Gly Ala Val Gly Glu Gly Ala Tyr Arg Glu Val Glu 

180 185 190 

Ala Glu Asp Pro Gly Arg Leu Ala Val Glu Ala Leu Val Arg Thr Arg 

195 200 205 

Arg Cys Val Cys Val Ala Gin Arg Trp Gly Gly Lys Arg Glu Val Met 

210 215 220 

Tyr Thr Ala Phe Lys Ala Leu Gly Asp Ser Val Asp Tyr Val Gin Val 
225 230 235 240 

Cys Asp Ser Asp Thr Arg Leu Asp Pro Met Ala Leu Leu Glu Leu Val 

245 250 255 

Arg Val Leu Asp Glu Asp Pro Arg Val Gly Ala Val Gly Gly Asp Val 

260 265 270 

Arg lie Leu Asn Pro Leu Asp Ser Trp Val Ser Phe Leu Ser Ser Leu 

275 280 285 

Arg Tyr Trp Val Ala Phe Asn Val Glu Arg Ala Cys Gin Ser Tyr Phe 

290 295 300 

His Cys Val Ser Cys lie Ser Gly Pro Leu Gly Leu Tyr Arg Asn Asn 
305 310 315 320 

Leu Leu Gin Gin Phe Leu Glu Ala Trp Tyr Asn Gin Lys Phe Leu Gly 

325 330 335 

Thr His Cys Thr Phe Gly Asp Asp Arg His Leu Thr Asn Arg Met Leu 

340 345 350 

Ser Met Gly Tyr Ala Thr Lys Tyr Thr Ser Arg Ser Arg Cys Tyr Ser 

355 360 365 

Glu Thr Pro Ser Ser Phe Leu Arg Trp Leu Ser Gin Gin Thr Arg Trp 

370 375 380 

Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr Asn Ala Leu Trp Trp His 
385 390 395 400 

Arg His His Ala Trp Met Thr Tyr Glu Ala Val Val Ser Gly Leu Phe 

405 410 415 

Pro Phe Phe Val Ala Ala Thr Val Leu Arg Leu Phe Tyr Ala Gly Arg 

420 425 430 

Pro Trp Ala Leu Leu Trp Val Leu Leu Cys Val Gin Gly Val Ala Leu 

435 440 445 

Ala Lys Ala Ala Phe Ala Ala Trp Leu Arg Gly Cys Val Arg Met Val 

450 455 460 

Leu Leu Ser Leu Tyr Ala Pro Leu Tyr Met Cys Gly Leu Leu Pro Ala 
465 470 475 480 

Lys Phe Leu Ala Leu Val Thr Met Asn Gin Ser Gly Trp Gly Thr Ser 

485 490 495 

Gly Arg Lys Lys Leu Ala Ala Asn Tyr Val Pro Val Leu Pro Leu Ala 

500 505 510 

Leu Trp Ala Leu Leu Leu Leu Gly Gly Leu Ala Arg Ser Val Ala Gin 
515 520 525 
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Glu Ala Arg Ala Asp 
530 

His Leu Ala Ala Gly 
545 

Leu Thr lie Tyr Trp 

565 

Gly Gly Tyr Arg Val 

580 



Trp Ser Gly Pro Ser Arg 
535 

Ala Gly Ala Tyr Val Ala 
550 555 
Val Gly Val Arg Arg Leu 

570 

Gin Val 



Ala Ala Glu Ala Tyr 
540 

Tyr Trp Val Val Met 

560 

Cys Arg Arg Arg Ser 

575 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Lys Glu Lys Ala Ala Glu Thr Met Glu lie Pro Glu Gly lie Pro 

15 10 15 

Lys Asp Leu Glu Pro Lys His Pro Thr Leu Trp Arg lie lie Tyr Tyr 

20 25 30 

Ser Phe Gly Val Val Leu Leu Ala Thr lie Thr Ala Ala Tyr Val Ala 

35 40 45 

Glu Phe Gin Val Leu Lys His Glu Ala lie Leu Phe Ser Leu Gly Leu 

50 55 60 

Tyr Gly Leu Ala Met Leu Leu His Leu Met Met Gin Ser Leu Phe Ala 
65 70 75 80 

Phe Leu Glu lie Arg Arg Val Asn Lys Ser Glu Leu Pro Cys Ser Phe 

85 90 95 

Lys Lys Thr Val Ala Leu Thr lie Ala Gly Tyr Gin Glu Asn Pro Glu 

100 105 110 

Tyr Leu lie Lys Cys Leu Glu Ser Cys Lys Tyr Val Lys Tyr Pro Lys 

115 120 125 

Asp Lys Leu Lys lie lie Leu Val lie Asp Gly Asn Thr Glu Asp Asp 

130 135 140 

Ala Tyr Met Met Glu Met Phe Lys Asp Val Phe His Gly Glu Asp Val 
145 150 155 160 

Gly Thr Tyr Val Trp Lys Gly Asn Tyr His Thr Val Lys Lys Pro Glu 

165 170 175 

Glu Thr Asn Lys Gly Ser Cys Pro Glu Val Ser Lys Pro Leu Asn Glu 

180 185 190 

Asp Glu Gly lie Asn Met Val Glu Glu Leu Val Arg Asn Lys Arg Cys 

195 200 205 

Val Cys lie Met Gin Gin Trp Gly Lys Arg Glu Val Met Tyr Thr Ala 

210 215 220 

Phe Gin Ala lie Gly Thr Ser Val Asp Tyr Val Gin Val Cys Asp Ser 
225 230 235 240 

Asp Thr Lys Leu Asp Glu Leu Ala Thr Val Glu Met Val Lys Val Leu 

245 250 255 

Glu Ser Asn Asp Met Tyr Gly Ala Val Gly Gly Asp Val Arg lie Leu 

260 265 270 
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Asn Pro Tyr Asp 
275 

Met Ala Phe Asn 

290 

Ser Cys lie Ser 
305 

Val Phe Leu Glu 

Thr Leu Gly Asp 

340 

Tyr Arg Thr Lys 
355 

Ser Leu Tyr Leu 
370 

Tyr Phe Arg Glu 
385 

He Trp Met Thr 

He Thr Ala Thr 

420 

Val Val Trp Leu 
435 

He Tyr Ala Cys 
450 

Leu Tyr Ser Met 
465 

Ala Leu Leu Thr 

Lys lie Val Gly 

500 

Ala Val Leu Cys 
515 

Asn Asp Trp Ser 
530 

Tyr Gly Cys Val 
545 

Tyr Trp Val Trp 

Thr Leu Val His 

580 



Ser Phe He Ser 

280 

Val Glu Arg Ala 
295 

Gly Pro Leu Gly 
310 

Ala Trp Tyr Arg 
325 

Asp Arg His Leu 

Tyr Thr His Lys 

360 

Arg Trp Leu Asn 
375 

Trp Leu Tyr Asn 
390 

Tyr Glu Ser Val 
405 

Val He Arg Leu 

Leu Leu Cys He 

440 

Trp Leu Arg Gly 
455 

Leu Tyr Met Thr 
470 

Leu Asn Lys Thr 
485 

Asn Tyr Met Pro 

Gly Gly Val Gly 

520 

Thr Pro Glu Lys 
535 

Gly Tyr Val Met 
550 

Val Lys Arg Cys 
565 

Asp lie Pro Asp 
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Phe Met Ser Ser 

Cys Gin Ser Tyr 

300 

Met Tyr Arg Asn 
315 

Gin Lys Phe Leu 
330 

Thr Asn Arg Val 
345 

Ser Arg Ala Phe 

Gin Gin Thr Arg 

380 

Ala Gin Trp Trp 
395 

Val Ser Phe He 
410 

He Tyr Ala Gly 
425 

Gin He Met Ser 

Asn Phe He Met 

460 

Gly Leu Leu Pro 
475 

Gly Trp Gly Thr 
490 

lie Leu Pro Leu 
505 

Tyr Ser He Tyr 

Gin Lys Glu Met 

540 

Tyr Met Val He 
555 

Cys Arg Lys Arg 
570 

Met Cys Val 
585 



Leu Arg Tyr Trp 
285 

Phe Asp Cys Val 

Asn He Leu Gin 

320 

Gly Thr Tyr Cys 
335 

Leu Ser Met Gly 
350 

Ser Glu Thr Pro 
365 

Trp Thr Lys Ser 

His Lys His His 

400 

Phe Pro Phe Phe 
415 

Thr He Trp Asn 
430 

Leu Phe Lys Ser 
445 

Leu Leu Met Ser 

Ser Lys Tyr Phe 

480 

Ser Gly Arg Lys 
495 

Ser He Trp Ala 
510 

Met Asp Cys Gin 
525 

Tyr His Leu Leu 

Met Ala Val Met 

560 

Ser Gin Thr Val 
575 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Pro He Phe Lys Lys Thr Leu He Val Leu Ser Phe He Phe Leu 
1 5 10 15 
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He Ser He Leu lie Tyr Leu Asn Met Tyr Leu Phe Gly Thr Ser Thr 

20 25 30 

Val Gly He Tyr Gly Val He Leu He Thr Tyr Leu Val He Lys Leu 

35 40 45 

Gly Leu Ser Phe Leu Tyr Glu Pro Phe Lys Gly Asn Pro His Asp Tyr 

50 55 60 

Lys Val Ala Ala Val He Pro Ser Tyr Asn Glu Asp Ala Glu Ser Leu 
65 70 75 80 

Leu Glu Thr Leu Lys Ser Val Leu Ala Gin Thr Tyr Pro Leu Ser Glu 

85 90 95 

He Tyr He Val Asp Asp Gly Ser Ser Asn Thr Asp Ala He Gin Leu 

100 105 110 

He Glu Glu Tyr Val Asn Arg Glu Val Asp He Cys Arg Asn Val He 

115 120 125 

Val His Arg Ser Leu Val Asn Lys Gly Lys Arg His Ala Gin Ala Trp 

130 135 140 

Ala Phe Glu Arg Ser Asp Ala Asp Val Phe Leu Thr Val Asp Ser Asp 
145 150 155 160 

Thr Tyr He Tyr Pro Asn Ala Leu Glu Glu Leu Leu Lys Ser Phe Asn 

165 170 175 

Asp Glu Thr Val Tyr Ala Ala Thr Gly His Leu Asn Ala Arg Asn Arg 

180 185 190 

Gin Thr Asn Leu Leu Thr Arg Leu Thr Asp He Arg Tyr Asp Asn Ala 

195 200 205 

Phe Gly Val Glu Arg Ala Ala Gin Ser Leu Thr Gly Asn He Leu Val 

210 215 220 

Cys Ser Gly Pro Leu Ser He Tyr Arg Arg Glu Val He He Pro Asn 
225 230 235 240 

Leu Glu Arg Tyr Lys Asn Gin Thr Phe Leu Gly Leu Pro Val Ser He 

245 250 255 

Gly Asp Asp Arg Cys Leu Thr Asn Tyr Ala He Asp Leu Gly Arg Thr 

260 265 270 

Val Tyr Gin Ser Thr Ala Arg Cys Asp Thr Asp Val Pro Phe Gin Leu 

275 280 285 

Lys Ser Tyr Leu Lys Gin Gin Asn Arg Trp Asn Lys Ser Phe Phe Arg 

290 295 300 

Glu Ser lie lie Ser Val Lys Lys He Leu Ser Asn Pro He Val Ala 
305 310 315 320 

Leu Trp Thr He Phe Glu Val Val Met Phe Met Met Leu He Val Ala 

325 330 335 

He Gly Asn Leu Leu Phe Asn Gin Ala He Gin Leu Asp Leu He Lys 

340 345 350 

Leu Phe Ala Phe Leu Ser He He Phe He Val Ala Leu Cys Arg Asn 

355 360 365 

Val His Tyr Met Val Lys His Pro Ala Ser Phe Leu Leu Ser Pro Leu 

370 375 380 

Tyr Gly He Leu His Leu Phe Val Leu Gin Pro Leu Lys Leu Tyr Ser 
385 390 395 400 

Leu Cys Thr He Lys Asn Thr Glu Trp Gly Thr Arg Lys Lys Val Thr 

405 410 415 

He Phe Lys 



(2) INFORMATION FOR SEQ ID NO : 6 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Met Tyr Leu Leu Asp Thr Thr Ser Thr Ala Ala lie Ser lie Tyr Ala 

15 10 15 

Leu Leu Leu Thr Ala Tyr Arg Ser Met Gin Val Leu Tyr Ala Arg Pro 

20 25 30 

lie Asp Gly Leu Ala Val Ala Ala Glu Pro Val Glu Thr Arg Pro Leu 

35 40 45 

Pro Ala Val Asp Val lie Val Pro Ser Phe Asn Glu Asp Pro Gly lie 

50 55 60 

Leu Ser Ala Cys Leu Ala Ser lie Ala Asp Gin Asp Tyr Pro Gly Glu 
65 70 75 80 

Leu Arg Val Tyr Val Val Asp Asp Gly Ser Arg Asn Arg Glu Ala lie 

85 90 95 

Val Arg Val Arg Ala Phe Tyr Ser Arg Asp Pro Arg Phe Ser Phe lie 

100 105 110 

Leu Leu Pro Glu Asn Val Gly Lys Arg Lys Ala Gin lie Ala Ala lie 

115 120 125 

Gly Gin Ser Ser Gly Asp Leu Val Leu Asn Val Asp Ser Asp Ser Thr 

130 135 140 

lie Ala Phe Asp Val Val Ser Lys Leu Ala Ser Lys Met Arg Asp Pro 
145 150 155 160 

Glu Val Gly Ala Val Met Gly Gin Leu Thr Ala Ser Asn Ser Gly Asp 

165 170 175 

Thr Trp Leu Thr Lys Leu lie Asp Met Glu Tyr Trp Leu Ala Cys Asn 

180 185 190 

Glu Glu Arg Ala Ala Gin Ser Arg Phe Gly Ala Val Met Cys Cys Cys 

195 200 205 

Gly Pro Cys Ala Met Tyr Arg Arg Ser Ala Leu Ala Ser Leu Leu Asp 

210 215 220 

Gin Tyr Glu Thr Gin Leu Phe Arg Gly Lys Pro Ser Asp Phe Gly Glu 
225 230 235 240 

Asp Arg His Leu Thr lie Leu Met Leu Lys Ala Gly Phe Arg Thr Glu 

245 250 255 

Tyr Val Pro Asp Ala lie Val Ala Thr Val Val Pro Asp Thr Leu Lys 

260 265 270 

Pro Tyr Leu Arg Gin Gin Leu Arg Trp Ala Arg Ser Thr Phe Arg Asp 

275 280 285 

Thr Phe Leu Ala Leu Pro Leu Leu Arg Gly Leu Ser Pro Phe Leu Ala 

290 295 300 

Phe Asp Ala Val Gly Gin Asn lie Gly Gin Leu Leu Leu Ala Leu Ser 
305 310 315 320 

Val Val Thr Gly Leu Ala His Leu lie Met Thr Ala Thr Val Pro Trp 

325 330 335 

Trp Thr lie Leu lie lie Ala Cys Met Thr lie lie Arg Cys Ser Val 

340 345 350 

Val Ala Leu His Ala Arg Gin Leu Arg Phe Leu Gly Phe Val Leu His 
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355 360 365 

Thr Pro lie Asn Leu Phe Leu lie Leu Pro Leu Lys Ala Tyr Ala Leu 

370 375 380 

Cys Thr Leu Ser Asn Ser Asp Trp Leu Ser Arg Tyr Ser Ala Pro Glu 
385 390 395 400 

Val Pro Val Ser Gly Gly Lys Gin Thr Pro lie Gin Thr Ser Gly Arg 

405 410 415 

Val Thr Pro Asp Cys Thr Cys Ser Gly Glu 

420 425 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Lys Arg Glu Val Met Tyr Thr Ala Phe Arg Ala Leu Gly Arg Ser Val 

15 10 15 

Asp Tyr Val Gin Val Cys Asp Ser Asp Thr Met Leu Asp Pro Ala Ser 

20 25 30 

Ser Val Glu Met Val Lys Val Leu Glu Glu Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Gin Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn Arg Val Leu Ser 

15 10 15 

Leu Gly Tyr Ala Thr Lys Tyr Thr Ala Arg Ser Lys Cys Leu Thr Glu 

20 25 30 

Thr Pro lie Glu Tyr Leu Arg Trp Leu Asn Gin Gin Thr Arg Trp Ser 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 3 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asp Ser Val 

1.5 10 15 

Asp Tyr Val Gin Val Cys Asp Ser Asp Thr Arg Leu Asp Pro Met Ala 

20 25 30 

Leu Leu Glu Leu Val Arg Val Leu Asp Glu Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Arg Glu Val Met Tyr Thr Ala Phe Gin Ala lie Gly Thr Ser Val 

15 10 15 

Asp Tyr Val Gin Val Cys Asp Ser Asp Thr Lys Leu Asp Glu Leu Ala 

20 25 30 

Thr Val Glu Met Val Lys Val Leu Glu Ser Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 

Lys Arg His Ala Gin Ala Trp Ala Phe Glu Arg Ser Asp Ala Asp Val 

15 10 15 

Phe Leu Thr Val Asp Ser Asp Thr Tyr lie Tyr Pro Asn Ala Leu Glu 

20 25 30 

Glu Leu Leu Lys Ser Phe Asn Asp Glu 
35 40 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Lys Arg Lys Ala Gin lie Ala Ala lie Gly Gin Ser Ser Gly Asp Leu 

15 10 15 

Val Leu Asn Val Asp Ser Asp Ser Thr lie Ala Phe Asp Val Val Ser 

20 25 30 

Lys Leu Ala Ser Lys Met Arg Asp Pro 
35 40 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



Lye Lys Lys lie Asn Ser His Arg Trp Leu Phe Asn Ala Phe Cys Pro 

15 10 15 

Val Leu Gin Pro Thr Val Val Thr Leu Val Asp Val Gly Thr Arg Leu 

20 25 30 

Asn Asn Thr Ala lie Tyr Arg Leu Trp Lys Val Phe Asp Met Asp 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 



Ala Phe Asn Val Glu Arg Ala Cys Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Asp Asp Arg His Leu Thr Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gin Gin Thr Arg Trp Thr Lys Ser Tyr Phe 
15 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNE S S : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCNTTYAAYG TNGARMGNGC NTGYCA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
RTTNGTNARR TGNCKRTCRT CNCC 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
RAARTANSWY TTNGTC CANC KNGTYTGYTG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Gly Trp Gly Thr Ser Gly Arg Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE : CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CCCGGGCAAG ATGGATTGTG AGAGGTTTCT ATGTGTCCTG 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CCCGGGTCAT ACATCAAGCA CCATGTCATA CTG 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 5 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNES S : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GTCTTATTTT GGGTGTGTTC AGTGCATTAG TGGACCTCTG GGAATGTACA GAAACTCCTT 6 0 

GTTGCATGAG TTTGTGGAAG ATTGGTACAA TCAAGAATTT ATGGGCAACC AATGTAGCTT 12 0 

TGGTGATGAC AGGCATCTCA CGAACCGGGT GCTGAGCCTG GGCTATGCAA CAAAATACAC 180 

AGCTCGATCT AAGTG CCTTA CTGAAACACC TATAGAATAT CTCAGATGGC TAAAC 2 35 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) S TRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 

Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 

15 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 

35 40 45 

Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys 

50 55 60 

Cys Leu Thr Glu Thr Pro Thr Lys Tyr Leu Arg Trp Leu Asn 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GTCCTACTTT GGCTGTGTGC AGTGTATTAG TGGGCCCTTG GGCATGTACC GCAACAGCCT 6 0 

CCTCCAGCAG TTCCTGGAGG ACTGGTACCA TCAGAAGTTC CTAGGCAGCA AGTGCAG CTT 120 
CGGGGATGAC CGGCACCTCA CCAACCGAGT CCTGAGCCTT GGCTACCGAA CTAAGTATAC 180 
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CGCGCGCTCC AAGTGCCTCA CAGAGACCCC CACTAAGTAC CTCCGGTGGC TCAAC 23 5 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 5 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : unknown 
<D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GTCCTACTTT GGCTGTGTGC AATGTATTAG TGGGCCTTTG GGCATGTACC GCAACAGCCT 60 

CCTTC AG CAG TTCCTGGAGG ATTGGTACCA TCAGAAGTTC CTAGGCAGCA AGTGCAGCTT 120 

TGGGGATGAT CGGCACCTTA CCAACCGAGT CCTGAGTCTT GGCTACCGGA CTAAGTATAC 180 

AGCACGCTCT AAGTGCCTCA CAGAGACCCC CACTAGGTAC CTTCGATGGC TCAAT 235 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 

15 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 

35 40 45 

Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys 

50 55 60 

Cys Leu Thr Glu Thr Pro Thr Lys Tyr Leu Arg Trp Leu Asn 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 
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15 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 

35 40 45 

Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys 

50 55 60 

Cys Leu Thr Glu Thr Pro Thr Arg Tyr Leu Arg Trp Leu Asn 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 

15 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 

35 40 45 

Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys 

50 55 60 

Cys Leu Thr Glu Thr Pro Thr Lys Tyr Leu Arg Trp Leu Asn Gin Gin 
65 70 75 80 

Thr Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr Asn Ser Leu 

85 90 95 

Trp Phe His Lys His His Leu Trp Met Thr Tyr Glu Ser Val Val Thr 

100 105 110 

Gly Phe Phe Pro Phe Phe Leu He Ala Thr Val He Gin Leu Phe Tyr 

115 120 125 

Arg Gly Arg He Trp Asn He Leu Leu Phe Leu Leu Thr Val Gin Leu 

130 135 140 

Val Gly He He Lys Ala Thr Tyr Ala Cys Phe Leu Arg Gly Asn Ala 
145 150 155 160 

Glu Met He Phe Met Ser Tyr Leu Ser Leu Leu Tyr Met Ser Ser Leu 

165 170 175 

Leu Pro Ala Lys He Phe Ala He Ala Thr He Asn Lys Ser 

180 185 190 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu Gly Met Tyr 

15 10 15 

Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr His Gin Lys 

20 25 30 

Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn 

35 40 45 

Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys 

50 55 60 

Cys Leu Thr Glu Thr Pro Thr Arg Tyr Leu Arg Trp Leu Asn Gin Gin 
65 70 75 80 

Thr Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr Asn Ser Leu 

85 90 95 

Trp Phe His Lys His His Leu Trp Met Thr Tyr Glu Ser Val Val Thr 

100 105 110 

Gly Phe Phe Pro Phe Phe Leu lie Ala Thr Val lie Gin Leu Phe Tyr 

115 120 125 

Arg Gly Arg lie Trp Asn lie Leu Leu Phe Leu Leu Thr Val Gin Leu 

130 135 140 

Val Gly He He Lys Ala Thr Tyr Ala Cys Phe Leu Arg Gly Asn Ala 
145 150 155 160 

Glu Met He Phe Met Ser Tyr Leu Ser Leu Leu Tyr Met Ser Ser Leu 

165 170 175 

Leu Pro Ala Lys He Phe Ala He Ala Thr He Asn Lys Ser 

180 185 190 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 1. . .1662 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATG CCG GTG CAG CTG ACT ACA GCC CTG CGT GTG GTG GGC ACC AGT CTG 4 8 

Met Pro Val Gin Leu Thr Thr Ala Leu Arg Val Val Gly Thr Ser Leu 
1 5 10 15 

TTT GCC CTG GTA GTG CTG GGA GGC ATC CTG GCG GCC TAT GTG ACA GGC 96 
Phe Ala Leu Val Val Leu Gly Gly lie Leu Ala Ala Tyr Val Thr Gly 

20 25 30 
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TAC CAG TTT ATC CAC AC A GAA AAG CAC TAC CTG TCC TTT GGC CTC TAC 144 

Tyr Gin Phe He His Thr Glu Lys His Tyr Leu Ser Phe Gly Leu Tyr 
35 40 45 

GGT GCC ATC CTG GGT CTA CAT CTG CTC ATC CAG AGC CTG TTT GCC TTC 192 
Gly Ala He Leu Gly Leu His Leu Leu He Gin Ser Leu Phe Ala Phe 
50 55 60 

CTG GAG CAC CGT CGA ATG CGC AGG GCA GGG CGC CCC CTC AAG CTG CAC 240 
Leu Glu His Arg Arg Met Arg Arg Ala Gly Arg Pro Leu Lys Leu His 
€5 70 75 80 

TGC TCC CAG AGG TCG CGT TCA GTG GCA CTC TGC ATT GCT GCC TAC CAA 2 88 

Cys Ser Gin Arg Ser Arg Ser Val Ala Leu Cys He Ala Ala Tyr Gin 

85 90 95 

GAG GAC CCC GAA TAC CTG CGC AAG TGC CTT CGC TCA GCT CAG CGC ATT 3 36 

Glu Asp Pro Glu Tyr Leu Arg Lys Cys Leu Arg Ser Ala Gin Arg He 

100 105 110 

GCC TTT CCA AAC CTC AAG GTG GTC ATG GTA GTG GAT GGC AAT CGC CAG 3 84 

Ala Phe Pro Asn Leu Lys Val Val Met Val Val Asp Gly Asn Arg Gin 
115 120 125 

GAA GAT ACC TAC ATG TTG GAC ATC TTC CAT GAG GTG CTG GGT GGC ACT 4 32 

Glu Asp Thr Tyr Met Leu Asp He Phe His Glu Val Leu Gly Gly Thr 
130 135 140 

GAG CAA GCT GGC TTC TTT GTG TGG CGT AGC AAT TTC CAT GAG GCG GGT 480 
Glu Gin Ala Gly Phe Phe Val Trp Arg Ser Asn Phe His Glu Ala Gly 
145 150 155 160 

GAA GGA GAG ACA GAG GCC AGC CTG CAG GAA GGC ATG GAG CGT GTG CGA 52 8 

Glu Gly Glu Thr Glu Ala Ser Leu Gin Glu Gly Met Glu Arg Val Arg 

165 170 175 

GCT GTG GTG TGG GCC AGC ACC TTC TCA TGC ATC ATG CAG AAG TGG GGG 5 76 

Ala Val Val Trp Ala Ser Thr Phe Ser Cys He Met Gin Lys Trp Gly 

180 185 190 

GGC AAG CGT GAG GTC ATG TAC ACT GCC TTC AAG GCC CTT GGC AAC TCA 624 
Gly Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asn Ser 
195 200 205 

GTG GAC TAC ATC CAG GTG TGT GAC TCT GAC ACT GTG CTG GAC CCA GCC 672 
Val Asp Tyr He Gin Val Cys Asp Ser Asp Thr Val Leu Asp Pro Ala 
210 215 220 

TGC ACC ATT GAG ATG CTT CGA GTC TTG GAA GAA GAT CCC CAA GTA GGA 72 0 

Cys Thr He Glu Met Leu Arg Val Leu Glu Glu Asp Pro Gin Val Gly 
225 230 235 240 

GGT GTT GGA GGA GAT GTC CAA ATC CTC AAC AAG TAT GAT TCA TGG ATC 76 8 

Gly Val Gly Gly Asp Val Gin He Leu Asn Lys Tyr Asp Ser Trp He 

245 250 255 
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TCC TTC CTG AGC AGT GTG AGG TAC TGG ATG GCT TTC AAC GTG GAG CGG 816 
Ser Phe Leu Ser Ser Val Arg Tyr Trp Met Ala Phe Asn Val Glu Arg 

260 265 270 



GCC TGC CAG TCC TAC TTT GGC TGT GTG CAA TGT ATT AGT GGG CCT TTG 864 
Ala Cys Gin Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu 
275 280 285 



GGC ATG TAC CGC AAC AGC CTC CTT CAG CAG TTC CTG GAG GAT TGG TAC 912 

Gly Met Tyr Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr 

290 295 300 

CAT CAG AAG TTC CTA GGC AGC AAG TGC AGC TTT GGG GAT GAT CGG CAC 960 

His Gin Lys Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His 

305 310 315 320 

CTT ACC AAC CGA GTC CTG AGT CTT GGC TAC CGG ACT AAG TAT ACA GCA 1008 

Leu Thr Asn Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala 

325 330 335 



CGC TCT AAG TGC CTC ACA GAG ACC CCC ACT AGG TAC CTT CGA TGG CTC 1056 
Arg Ser Lys Cys Leu Thr Glu Thr Pro Thr Arg Tyr Leu Arg Trp Leu 

340 345 350 



AAT CAG CAA ACC CGC TGG AGC AAG TCT TAC TTT CGG GAA TGG CTC TAC 1104 

Asn Gin Gin Thr Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr 
355 360 365 

AAT TCT CTG TGG TTC CAT AAG CAC CAC CTC TGG ATG ACC TAT GAA TCA 1152 

Asn Ser Leu Trp Phe His Lys His His Leu Trp Met Thr Tyr Glu Ser 

370 375 380 



GTG GTC ACA GGT TTC TTC CCA TTC TTC CTC ATT GCT ACA GTC ATA CAA 1200 
Val Val Thr Gly Phe Phe Pro Phe Phe Leu lie Ala Thr Val lie Gin 
385 390 395 400 



CTT TTC TAC CGT GGC CGC ATC TGG AAC ATT CTC CTC TTC CTG CTA ACA 1248 

Leu Phe Tyr Arg Gly Arg lie Trp Asn lie Leu Leu Phe Leu Leu Thr 

405 410 415 

GTG CAG CTG GTG GGC ATT ATC AAG GCT ACC TAT GCC TGC TTC CTT CGA 12 96 

Val Gin Leu Val Gly lie lie Lys Ala Thr Tyr Ala Cys Phe Leu Arg 

420 425 430 



GGC AAT GCA GAG ATG ATC TTC ATG TCC CTC TAC TCC CTT CTC TAT ATG 1344 
Gly Asn Ala Glu Met lie Phe Met Ser Leu Tyr Ser Leu Leu Tyr Met 
435 440 445 



TCC AGC CTC TTG CCA GCC 
Ser Ser Leu Leu Pro Ala 
450 



AAG ATC TTT GCT ATT 
Lys lie Phe Ala lie 
455 



GCT ACC ATC AAC AAG 13 92 

Ala Thr lie Asn Lys 

460 



TCT GGC TGG GGC ACT TCT GGC AGG AAA ACC ATT GTC GTG AAC TTC ATT 1440 
Ser Gly Trp Gly Thr Ser Gly Arg Lys Thr lie Val Val Asn Phe lie 
465 470 475 480 
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GGC CTA ATC 
Gly Leu lie 

GCC TAC AC A 
Ala Tyr Thr 

TTC CTA GTC 
Phe Leu Val 
515 

CTC ATG CTG 
Leu Met Leu 
530 

GAA CAG TAT 
Glu Gin Tyr 
545 



CCC GTG TCC 
Pro Val Ser 
485 

GCT TAT TGC 
Ala Tyr Cys 
500 

TCT GGG GCC 
Ser Gly Ala 

TAT CTG GCC 
Tyr Leu Ala 

AGC CTG GCT 
Ser Leu Ala 
550 



ATC TGG GTG 
lie Trp Val 

CAG GAC CTG 
Gin Asp Leu 
505 

ATC CTG TAT 
lie Leu Tyr 
520 

ATT ATT GCC 
He lie Ala 
535 

TTT GCG GAG 
Phe Ala Glu 
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GCA GTT CTT 
Ala Val Leu 
490 

TTC AGT GAG 
Phe Ser Glu 



GGC TGC TAC 
Gly Cys Tyr 

CGG AGG TGT 
Arg Arg Cys 
540 

GTG TGA 
Val 



CTA GGG GGG 
Leu Gly Gly 
495 

ACC GAG CTA 
Thr Glu Leu 
510 

TGG GTG GCC 
Trp Val Ala 
525 

GGG AAG AAG 

Gly Lys Lys 



TTA 14 68 
Leu 



GCC 153 6 
Ala 



CTC 1584 
Leu 



CCA 16 32 
Pro 



1665 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Pro Val Gin Leu Thr Thr Ala Leu Arg Val Val Gly Thr Ser Leu 

15 10 15 

Phe Ala Leu Val Val Leu Gly Gly lie Leu Ala Ala Tyr Val Thr Gly 

20 25 30 

Tyr Gin Phe lie His Thr Glu Lys His Tyr Leu Ser Phe Gly Leu Tyr 

35 40 45 

Gly Ala lie Leu Gly Leu His Leu Leu lie Gin Ser Leu Phe Ala Phe 

50 55 60 

Leu Glu His Arg Arg Met Arg Arg Ala Gly Arg Pro Leu Lys Leu His 
65 70 75 80 

Cys Ser Gin Arg Ser Arg Ser Val Ala Leu Cys He Ala Ala Tyr Gin 

85 90 95 

Glu Asp Pro Glu Tyr Leu Arg Lys Cys Leu Arg Ser Ala Gin Arg He 

100 105 110 

Ala Phe Pro Asn Leu Lys Val Val Met Val Val Asp Gly Asn Arg Gin 

115 120 125 

Glu Asp Thr Tyr Met Leu Asp He Phe His Glu Val Leu Gly Gly Thr 

130 135 140 

Glu Gin Ala Gly Phe Phe Val Trp Arg Ser Asn Phe His Glu Ala Gly 
145 150 155 160 

Glu Gly Glu Thr Glu Ala Ser Leu Gin Glu Gly Met Glu Arg Val Arg 
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165 170 175 

Ala Val Val Trp Ala Ser Thr Phe Ser Cys lie Met Gin Lys Trp Gly 

180 185 190 

Gly Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asn Ser 

195 200 205 

Val Asp Tyr lie Gin Val Cys Asp Ser Asp Thr Val Leu Asp Pro Ala 

210 215 220 

Cys Thr lie Glu Met Leu Arg Val Leu Glu Glu Asp Pro Gin Val Gly 
225 230 235 240 

Gly Val Gly Gly Asp Val Gin lie Leu Asn Lys Tyr Asp Ser Trp lie 

245 250 255 

Ser Phe Leu Ser Ser Val Arg Tyr Trp Met Ala Phe Asn Val Glu Arg 

260 265 270 

Ala Cys Gin Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu 

275 280 285 

Gly Met Tyr Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr 

290 295 300 

His Gin Lys Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His 
305 310 315 320 

Leu Thr Asn Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala 

325 330 335 

Arg Ser Lys Cys Leu Thr Glu Thr Pro Thr Arg Tyr Leu Arg Trp Leu 

340 345 350 

Asn Gin Gin Thr Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr 

355 360 365 

Asn Ser Leu Trp Phe His Lys His His Leu Trp Met Thr Tyr Glu Ser 

370 375 380 

Val Val Thr Gly Phe Phe Pro Phe Phe Leu lie Ala Thr Val lie Gin 
385 390 395 400 

Leu Phe Tyr Arg Gly Arg lie Trp Asn lie Leu Leu Phe Leu Leu Thr 

405 410 415 

Val Gin Leu Val Gly lie lie Lys Ala Thr Tyr Ala Cys Phe Leu Arg 

420 425 430 

Gly Asn Ala Glu Met lie Phe Met Ser Leu Tyr Ser Leu Leu Tyr Met 
435 440 445 

Ser Ser Leu Leu Pro Ala Lys lie Phe Ala lie Ala Thr lie Asn Lys 

450 455 460 

Ser Gly Trp Gly Thr Ser Gly Arg Lys Thr lie Val Val Asn Phe lie 

465 470 475 480 

Gly Leu lie Pro Val Ser lie Trp Val Ala Val Leu Leu Gly Gly Leu 

485 490 495 

Ala Tyr Thr Ala Tyr Cys Gin Asp Leu Phe Ser Glu Thr Glu Leu Ala 

500 505 510 

Phe Leu Val Ser Gly Ala lie Leu Tyr Gly Cys Tyr Trp Val Ala Leu 

515 520 525 

Leu Met Leu Tyr Leu Ala lie lie Ala Arg Arg Cys Gly Lys Lys Pro 

530 535 540 

Glu Gin Tyr Ser Leu Ala Phe Ala Glu Val 
545 550 

<2) INFORMATION FOR SEQ ID NO: 33: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Arg Trp Leu Aan Gin Gin Thr Arg Trp 
1 5 

(2) INFORMATION FOR SEQ ID N0:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TACTGGATGG CTTTCAACGT GGAG 24 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asn Ser Val 

1 5 10 15 

Asp Tyr lie Gin Val Cys Asp Ser Asp Thr Val Leu Asp Pro Ala Cys 

20 25 30 

Thr lie Glu Met Leu Arg Val Leu Glu Glu Asp 

35 40 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
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Lys Cys Ser Phe Gly Asp Asp Arg His Leu Thr Asn Arg Val Leu Ser 

15 10 15 

Leu Gly Tyr Arg Thr Lys Tyr Thr Ala Arg Ser Lys Cys Leu Thr Glu 

20 25 30 

Thr Pro Thr Arg Tyr Leu Arg Trp Leu Asn Gin Gin Thr Arg Trp Ser 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
GTCATCCAGA GGTGGTGCTT ATGG 24 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CCGAATTCAA GATGGCGGTG CAGCTGACTA CAGCC 3 5 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CCGAATTCTC ACACCTCCGC AAAAG CCAGG C 31 

(2) INFORMATION FOR SEQ ID NO : 4 0 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

His Cys Thr Phe Gly Asp Asp Arg His Leu Thr Asn Arg Met Leu Ser 

15 10 15 

Met Gly Tyr Ala Thr Lys Tyr Thr Ser Arg Ser Arg Cys Tyr Ser Glu 

20 25 30 

Thr Pro Ser Ser Phe Leu Arg Trp Leu Ser Gin Gin Thr Arg Trp Ser 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
TGTGCAGTGT AATTAGTGGG CCCT 24 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Tyr Cys Thr Leu Gly Asp Asp Arg His Leu Thr Asn Arg Val Leu Ser 

15 10 15 

Met Gly Tyr Arg Thr Lys Tyr Thr His Lys Ser Arg Ala Phe Ser Glu 

20 25 30 

Thr Pro Ser Leu Tyr Leu Arg Trp Leu Asn Gin Gin Thr Arg Trp Thr 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO:43: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 23 base pairs 
{ B ) TYPE: nucleic acid 

( C ) STRANDEDNES S : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

GTTGAGCCAC CGGAGGTACT TAG 2 3 

(2) INFORMATION FOR SEQ ID NO: 44: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 amino acids 
(8) TYPE: amino acid 
<C) STRANDEDNES S : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Pro Val Ser lie Gly Asp Asp Arg Cys Leu Thr Asn Tyr Ala lie Asp 

15 10 15 

Leu Gly Arg Thr Val Tyr Gin Ser Thr Ala Arg Cys Asp Thr Asp Val 

20 25 30 

Pro Phe Gin Leu Lys Ser Tyr Leu Lys Gin Gin Asn Arg Trp Asn Lys 

35 40 45 

Ser Phe Phe Arg Glu Ser 
50 

(2) INFORMATION FOR SEQ ID NO: 45: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Asn Met Tyr Leu Ala Glu Asp Arg lie Leu Cys Trp Glu Leu Val Ala 

15 10 15 

Lys Arg Asp Ala Lys Trp Val Leu Lys Tyr Val Lys Glu Ala Thr Gly 

20 25 30 

Glu Thr Asp Val Pro Glu Asp Val Ser Glu Phe lie Ser Gin Arg Arg 

35 40 45 

Arg Trp Leu Asn Cys Ala Met Phe Ala Ala 
50 55 

(2) INFORMATION FOR SEQ ID NO: 46: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Pro Ser Asp Phe Gly Glu Asp Arg His Leu Thr lie Leu Met Leu Lys 

15 10 15 

Ala Gly Phe Arg Thr Glu Tyr Val Pro Asp Ala lie Val Ala Thr Val 

20 25 30 

Val Pro Asp Thr Leu Lys Pro Tyr Leu Arg Gin Gin Leu Arg Trp Ala 

35 40 45 

Arg Ser Thr Phe Arg Asp Thr 
50 55 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

Lys Ala Gly Ala Glu Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr 

15 10 15 

Asn Ala Pro Phe lie Leu Asn Leu Asp Cys Asp His Tyr Val Asn Asn 

20 25 30 

Ser Lys Ala Val Arg Glu Ala Met Cys Phe Leu Met Asp 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Tyr Gly Ser Val Thr Glu Asp lie Leu Thr Gly Phe Lys Met His Cys 

15 10 15 

Arg Gly Trp Arg Ser lie Tyr Cys Met Pro Leu Arg Pro Ala Phe Lys 

20 25 30 

Gly Ser Ala Pro He Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg 
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35 40 
Trp Ala Leu Gly Ser Val Glu lie Phe 
50 55 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
GTGCTTCTGT CTCTCTACGC G 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 
CCAGTCCCAA TATAGTCCAG ACTG 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 51 
GGTGTGTTCA GTGCATTAGT GGA 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
TAGCCATCTG AGATATTCTA TAGGT 25 
(2) INFORMATION FOR SEQ ID NO: 53: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
GTCAGAGCTA CTTCCACTGT G 21 
(2) INFORMATION FOR SEQ ID NO: 54: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: CDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
AAGGAGGAGG GCGTCTCCGA G 21 
(2) INFORMATION FOR SEQ ID NO: 55: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GAATTCCGGG CGCCCGGGAC TCACGCCCCT TCCTTTCCCC TCTCGCTCCC AGCAGGACGC 60 

GCCCAAGCCC ACTCCTGCAG CCCGCCGCTG CTCCGGCCTG GCCCGGAGGG TGCTGACCAT 12 0 

CGCCTTCGCC CTGCTCATCC TGGCCCTCAT GACCTGGGCC TACGCCGCCG GGGTGCCGCT 180 

GGCCTCCGAT CGCTACGGCC TCCTGGCCTT CGGCCTCTAC GGGGCCTTCC TTTCAGCGCA 240 

CCTGGTGGCG CAGAGCCTCT TCGCGTACCT GGAGCACCGG CGGGTGGCGG CGGCGGCGCG 3 00 

GGGGCCGCTG GATGCAGCCA CCGCGCGCAG TGTGGCGCTG ACCATCTCCG CCTACCAGGA 360 

GGACCCCGCG TACCTGCGCC AGTGCCTGGC GTCCGCCCGC GCCCTGCTGT ACCCGCGCGC 420 

GCGCGTGCGC GTCCTCATGG TGGTGGATGG CAACCGCGCC GAGGACCTCT ACATGGTCGA 4 80 

CATGTTCCGC GAGGTCTTCG CTGACGAGGA CCCCGCCACG TACGTGTGGG ACGGCAACTA 540 

CCACCAGCCC TGGGAACCCG CGGCGGCGGG CGCGGTGGGC GCCGGAGCCT ATCGGGAGGT 6 00 

GGAGGCGGAG GATCCTGGGC GGCTGGCAGT GGAGGCGCTG GTGAGGACTC GCAGGTGCGT 660 
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GTGCGTGGCG CAGCGCTGGG 
CGGAGATTCG GTGGACTACG 



GCGGCAAGCG CGAGGTCATG TACACAGCCT TCAAGGCGCT 
TGCAGGTCTG TGACTCGGAC ACAAGGTTGG ACCCCATGGC 



720 
780 



ACTGCTGGAG CTCGTGCGGG TACTGGACGA GGACCCCCGG GTAGGGGCTG TTGGTGGGGA 84 0 



CGGTCCTCTA GGCCTATATA GGAATAACCT CTTGCAGCAG TTTCTTGAGG CCTGGTACAA 102 0 

CCAGAAGTTC ~ CTGGGTACCC ACTGTACTTT TGGGGATGAC CGGCACCTCA CCAACCGCAT 108 0 

GCTCAGCATG GGTTATGCTA CCAAGTACAC CTCCAGGTCC CGCTGCTACT CAGAGACGCC 114 0 

CTCGTCCTTC CTGCGGTGGC TGAGCCAGCA GACACGCTGG TCCAAGTCGT ACTTCCGTGA 12 00 

GTGGCTGTAC AACGCGCTCT GGTGGCACCG GCACCATGCG TGGATGACCT ACGAGGCGGT 126 0 

GGTCTCCGGC CTGTTCCCCT TCTTCGTGGC GGCCACTGTG CTGCGTCTGT TCTACGCGGG 13 2 0 

CCGCCCTTGG GCGCTGCTGT GGGTGCTGCT GTGCGTGCAG GGCGTGGCAC TGGCCAAGGC 13 8 0 

GGCCTTCGCG GCCTGGCTGC GGGGCTGCCT GCGCATGGTG CTTCTGTCGC TCTACGCGCC 144 0 

CCTCTACATG TGTGGCCTCC TGCCTGCCAA GTTCCTGGCG CTAGTCACCA TGAACCAGAG 1500 

TGGCTGGGGC ACCTCGGGCC GGCGGAAGCT GGCCGCTAAC TACGTCCCTC TGCTGCCCCT 156 0 

GGCGCTCTGG GCGCTGCTGC TGCTTGGGGG CCTGGTCCGC AGCGTAGCAC ACGAGGCCAG 16 20 

GGCCGACTGG AG CGGCCCTT CCCGCGCAGC CGAGGCCTAC CACTTGGCCG CGGGGGCCGG 16 8 0 

CGCCTACGTG GGCTACTGGG TGGCCATGTT GACGCTGTAC TGGGTGGGCG TGCGGAGGCT 174 0 

TTGCCGGCGG CGGACCGGGG GCTACCGCGT CCAGGTGTGA GTCCAGCCAC GCGGATGCCG 1800 

CCTCAAGGGT CTTCAGGGGA GGCCAGAGGA GAGCTGCTGG GCCCCGAGCC ACGAACTTGC 186 0 

TGGGTGGTTC TCTGGGCCTC AGTTTCCCTC CTCTGCCAAA CGAGGGGGTC AGCCCAAGAT 192 0 

TCTTCAGTCT GGACTATATT GGGACTGGGA CTTCTGGGTC TCCAGGGAGG GTATTTATTG 198 0 

GTCAGGATGT GGGATTTGAG GAGTGGAGGG GAAGGGGTCC TGCTTTCTCC TCGTTCTTAT 204 0 

TTAATCTCCA TTTCTACTGT GTGATCAGGA TGTAATAAAG AATTTTATTT ATTTTCAAAA 2100 

AAAAAAAA 2108 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 56 : 

Asn Met Tyr Leu Ala Glu Asp Arg lie Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Asn Gin Cys Ser Phe Gly Asp Asp Arg His 
15 10 



TGTGCGGATC CTTAACCCTC 
GGTAGCCTTC AATGTGGAGC 



TGGACTCCTG GGTCAGCTTC CTAAGCAGCC TGCGATACTG 
GGGCTTGTCA GAGCTACTTC CACTGTGTAT CCTGCATCAG 



900 



960 
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WHAT IS CLAIMED IS: 

1 . An isolated and purified DNA molecule comprising a preselected DN A 
segment encoding hyaluronan synthase-2, a biologically active variant 
thereof or a biologically active subunit of the variant. 

2. The DNA molecule of claim 1 wherein the preselected DNA segment 
encodes murine hyaluronan synthase-2. 

3. The DNA molecule of claim 1 or 2 wherein the preselected DNA 
segment encodes a hyaluronan synthase-2 having SEQ ID NO:2. 

4. The DNA molecule of claim 1 wherein the preselected DNA segment 
comprises SEQ ID NO:l . 

5. The DNA molecule of claim 1 wherein the preselected DNA segment 
encodes human hyaluronan synthase-2. 

6. The DNA molecule of claim 1 or 5 wherein the preselected DNA 
segment comprises SEQ ID NO:23. 

7. An isolated and purified DNA molecule comprising SEQ ID NO:l . 

8. An isolated and purified DNA molecule comprising a preselected DNA 
segment encoding hyaluronan synthase-3, or a biologically active subunit 
or variant thereof. 

9. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes murine hyaluronan synthase-3. 
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10. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes a hyaluronan synthase-3 having SEQ ID NO:32. 

1 1 . The DNA molecule of claim 8 wherein the preselected DNA segment 
comprises SEQ ID NO: 3 1 . 

12. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes human hyaluronan synthase-3. 

1 3 . The DNA molecule of claim 8 wherein the preselected DNA segment 
comprises SEQ ID NO:25. 

14. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes a polypeptide comprising SEQ ID NO:29. 

1 5. A primer or a probe, having at least about 15 nucleotides, wherein the 
primer or probe has at least about 80% identity to the DNA molecule of 
claim 8. 

16. An expression cassette comprising a promoter operably linked to a 
preselected DNA segment encoding hyaluronan synthase-2. 

1 7. An expression cassette comprising a promoter operably linked to a 
preselected DNA segment encoding hyaluronan synthase-3. 

1 8. A host cell, the genome of which is augmented by a preselected DNA 
segment encoding hyaluronan synthase-2. 

19. A host cell, the genome of which is augmented by a preselected DNA 
segment encoding hyaluronan synthase-3. 
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20. A method to produce hyaluronan synthase-2, comprising: culturing a host 
cell transformed with a nucleic acid molecule comprising a DN A 
segment encoding hyaluronan synthase-2 operably linked to a promoter, 
so that said host cell expresses said hyaluronan synthase-2. 



2 1 . The method of claim 20 further comprising isolating hyaluronan 
synthase-2 from the host cell. 

22. A method to produce hyaluronan synthase-3, comprising: culturing a host 
cell transformed with a nucleic acid molecule comprising a DNA 
segment encoding hyaluronan synthase-3 operably linked to a promoter, 
so that said host cell expresses said hyaluronan synthase-3. 

23. The method of claim 22 further comprising isolating hyaluronan 
synthase-3 from the host cell. 

24. A method of altering the amount of hyaluronan produced by a cell, 
comprising: 

(a) introducing into a host cell a preselected DNA segment encoding . 
hyaluronan synthase-2 operably linked to a promoter functional in 
the host cell so as to yield a transformed host cell; and 

(b) expressing the preselected DNA segment in the transformed host 
cell in an amount that alters the amount of hyaluronan produced 
by the transformed cell relative to the amount of hyaluronan 
produced by a corresponding untransformed cell. 



25. A method of altering the amount of hyaluronan produced by a cell, 
comprising: 

(a) introducing into a host cell a preselected DNA segment encoding 
hyaluronan synthase-3 operably linked to a promoter functional in 
the host cell so as to yield a transformed host cell; and 
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(b) expressing the preselected DNA segment in the transformed host 
cell in an amount that alters the amount of hyaluronan produced 
by the transformed cell relative to the amount of hyaluronan 
produced by a corresponding untransformed cell. 

The method of claim 24 or 25 wherein the amount of hyaluronan 
produced by the transformed host cell is increased relative to the amount 
of hyaluronan produced by the corresponding untransformed host cell. 

The method of claim 24 or 25 wherein the amount of hyaluronan 
produced by the transformed host cell is decreased relative to the amount 
of hyaluronan produced by the corresponding untransformed host cell. 

Isolated, purified hyaluronan synthase-2 polypeptide, or a biologically 
active subunit or variant thereof. 

The hyaluronan synthase-2 polypeptide of claim 28 having SEQ ID 
NO:2. 

Isolated, purified hyaluronan synthase-3 polypeptide, or a 
biologically active subunit or variant thereof. 

The hyaluronan synthase-3 polypeptide of claim 3 1 having SEQ ID 
NO:32. 

A method to prevent or treat a condition associated with an alteration in 
hyaluronan synthesis or extracellular accumulation, comprising: 
administering to a mammal afflicted with, or at risk of, said condition an 
amount of mammalian hyaluronan synthase-2 effective to alter 
hyaluronan synthesis or extracellular accumulation. 



A method to prevent or treat a condition associated with an alteration in 
hyaluronan synthesis or extracellular accumulation, comprising: 
administering to a mammal afflicted with, or at risk of, said condition an 
amount of mammalian hyaluronan synthase-3 effective to alter 
hyaluronan synthesis or extracellular accumulation. 

A method to identify a mammal afflicted with, or at risk of, a condition 
associated with aberrant hyaluronan synthesis or extracellular 
accumulation, comprising: 

(a) contacting an agent that binds to mammalian hyaluronan 
synthase-2 with a mammalian sample suspected of containing 
hyaluronan synthase-2 so as to form a complex; and 

(b) detecting or determining the presence or amount of complex 
formation and correlating the presence or amount of complex 
formation with the presence or absence of the condition. 

A method to identify a mammal afflicted with, or at risk of, a condition 
associated with aberrant hyaluronan synthesis or extracellular 
accumulation, comprising: 

(a) contacting an agent that binds to mammalian hyaluronan 
synthase-3 with a mammalian sample suspected of containing 
hyaluronan synthase-3 so as to form a complex; and 

(b) detecting or determining the presence or amount of complex 
formation and correlating the presence or amount of complex 
formation with the presence or absence of the condition. 

A method for detecting hyaluronan synthase-2 DNA, comprising: 
(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-2 
RNA, with an amount of at least two oligonucleotides under 
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conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-2 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-2-specific oligonucleotide; and 
(b) detecting the presence or amount of the amplified hyaluronan 
synthase-2 DNA. 

37. A method for detecting hyaluronan synthase-3 DNA, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-3 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-3 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-3 -specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-3 DNA. 

38. A method for detecting a condition associated with aberrant hyaluronan 
synthesis or extracellular accumulation, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-2 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-2 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-2-specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-2 DNA, wherein the presence or amount of hyaluronan 
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synthase-2 DN A is indicative of the presence of the condition in 
said mammal. 

39. A method for detecting a condition associated with aberrant hyaluronan 
synthesis or extracellular accumulation, comprising: 

(a) contacting an amount of DN A obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-3 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DN A by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-3 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-3 -specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-3 DNA, wherein the presence or amount of hyaluronan 
synthase-3 DNA is indicative of the presence of the condition in 
said mammal. 

40. The method of claim 36, 37, 38 or 39 wherein the physiological sample is 
a tissue sample. 

41 . The method of claim 36, 37, 38 or 39 wherein the physiological sample is 
a fluid. 

42. A therapeutic method, comprising: administering to a mammal an 
amount of an agent effective to alter native hyaluronan synthase-2 
activity in said mammal. 

43. A therapeutic method, comprising: administering to a mammal an 
amount of an agent effective to alter native hyaluronan synthase-3 
activity in said mammal. 
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44. A method to prepare hyaluronan, comprising: contacting an amount of 
isolated hyaluronan synthase-2 with a mixture of components under 
conditions effective to yield hyaluronan. 



45. The method of claim 44 wherein the hyaluronan synthase-2 is obtained 
by the method of claim 20. 



46. A method to prepare hyaluronan, comprising: contacting an amount of 
isolated hyaluronan synthase-3 with a mixture of components under 
conditions effective to yield hyaluronan. 



47. The method of claim 44 wherein the hyaluronan synthase-3 is obtained 
by the method of claim 22. 
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215 
239 



1 acatgtaagaagaaggagaagtcaaggcgtctggaaagaattacccagtcctggcttcgagcagcccattga 

73 acgggggactcgaaccagccaaagacttcttcattctgctcttgctagactctgctgagcctcgacccggct 

145 tgcaggttgatgtgaaaagagattttgtgtcgtcggagggaaggggattggagcaaatagcaaaacaggggg 

217 aaaagttaatttatctttaaagcagatataacaaagaattagaagacttaagtgcagcggaaatataaagag 

289 aatattagtgaaatttcttctcaaagaggggagaaccaagcatttaaggctcccccatctttttttttaaat 

3 61 gttgtttttaaatttcctattttttttggccggtcgtctcaaattcatctgatttcctattacctcaatttt 

433 ggaaacttccttccacgaccccccgggaccacacagacaggcggaggacgagtctatgagcaggagctgaac 

505 aagATGCATTGTGAGAGGTTTCTATGTGTCCTGAGAATAATTGG^ 

MHCERFLCVLRIIGTTLFGVS JL L 23 

577 CTCGGAATCACAGCTGCTTATATTGTTGGCTACCAGTTTATCCAAACA^ 

LGITAAYIVGYQFIQTDNYYFSFG 47 

649 CTGTACGGTGCCTTTTTAGCCTCGC ATCTC ATC ATCCAAAGCCTCTTTGCCTTTTTGGAACACCGGAAAATC 

LYGAFLASHLIIQSLFAFLEHRKM 71 

721 AAGAAGTCCCTTGAAACCCCGATTAAATTGAACAAAACGCT^ 

kkslet piklnktvalc iaayqed 

793 C CTGACTACTTACGGAAATGTTTGCAATCTGTGAAAAGGCTC^ 

PDYLRKCLQSVKRLTYPGIKVVMV 

865 ATCGATGGGAACTCAGACGACGACCTTTACATGATGGACATATTCAGCGA^ 

IDGNSDDDLYMMDIFSEVIGRDKS143 

937 GCCACGTACATCTGGAAGAACAACTTTCATGAAAAGGGACCTGGT 

ATYIWKNNFHEKGPGETEESHKES167 
1009 TCACAACATGTCACCCAATTGGTCTTGTCTAACAAAAGTATTTGCATCATGCAAAAATGGG^ 

SQHVTQLVLSNKSICIMQ K W G G K R 191 
1081 GAAGTCATGTACACAGCCTTCAGAGCACTGGGGCGAAGCGTGGATTATGTACAGGTGTGTC 

EVMYTAFRALGRSVDYVQVCDSDT 

1153 ATGCTTGACCCTGCCTCATCTGTGGAGATGGTGAAGGTCm 

MLDPASSVEMVKVLEEDPMVGGVG 

1225 GGAG ATGTC C AG ATTTTAAACAAGTATG ATTCCTGGATCTC CTTCCTC AG C AGC GTG AG AT ACTGGATGGCT 

GDVQI LNKYDSWISFLSSVRYWMA263 
1297 TTTAATATAGAAAGGGCCTGCCAGTCTTATTTTGGCTGTGTCCAGTGCATAAGCGGTC 

FNI ERACQSYFGCVQC 1 SGPLG M Y 287 
1369 AGAAACTCCTTGCTGCATGAATTTGTGGAAGACTGGTACAATC 

RNSLLHEFVEDWYNQEFMGNQCSF311 
1441 GGTGACGACAGGCACCTTACC AAC AGGGTGTTGAGTCTGGGCTATGCAACTAAATACACGGCTCGGTCCAAG 

GDDRHLTNRVLSLGYATKYTARSK 
1513 TGCCTTACTGAAACTCCCATAGAATATCTGAGATGGCTGAACCAGCAGACCCGATGGAGCAAGTCCTACTTC 

CLTETP I EYLRWLNQQTRWSKSYF 
1585 CGAGAGTGG CTGTAC AATGCC ATGTGGTTTCACAAGCATCACCTGTGG ATG ACC TATGAAGCTGT^ 

REWLYNAMWFHKHHLWMTYEAVIT383 

1657 GG ATTCTTTC CTTTC TTTCTC ATTGC C AC AG TC ATC C AGC TCTTCTAC AGGGG TAAAATC TGGAAC ATC CTC 

GFFPFFLIATVIQLFYRGKIWNIL407 

1729 CTCTTC C TGTT AAC TG TCC AG CTAGTGGGTCTC ATC AAGTC ATCTTTTG C C AGCTGC C TT AG AGG AAAT ATC 

LFLLTVQLVGLIKSSFASCLRGNI431 

1801 GTCATGGTATTCATGTCTCTGTATTCAGTGTTATACATGTCAAGTCTACTTCCTC 

VMVFMS LYSVLYM SSLLP A K M F A I 455 
1873 GCAACCATAAACAAAGCTGGGTGGGGCACATCTGGAAGGAAGACCATTGTTC 

ATINKAGWGTSGRKTIVVNFIGLI 479 

1945 CCAGTGTCCGTGTGGTTTACAATCCTTCTAGGTGGTGTAATTTTCACCAl^ 

PVSVWFTILLGGVIFTIYKESK K . P 503 

2 017 TTTTC C GAATC C AAAC AG ACTGTTCTC ATCGTGGGAACTTTG ATC T ATGC ATGC TACTGGGTC ATGC TTTTG 

FS ESKQTVL IVGTLIYACYWVMLL 527 

2089 ACTCTOT ATGTGGTTCTC ATC AATAAGTG TGGC AGGC GG AAG AAGGG A C AAC AGTATG AC ATGG TG CTTG AT 

TLYVVL INKCGRRKKGQQYDMVLD 551 

2161 GTA TOA t-ga^^-fcfcqtaqtcacacctaaaoJigiiffaeaeaeacaeacatcarnrncacacaCflCcttagctc 

V 552 
2 232 ctcaaggggctatacagtattgtggcaccgcaccctgccaccacaggagacatatcactgctgctgggactt 

2304 gaacaaagacattcaatgggggttggtttcttttttattctgccaaagcaaattgatacatcagtgagaaga 

2376 aagtccgattaaatctgacagttttaggacggtgggatgatgtcttggcttatgcacttttcccttactgtg 

2448 catctgcctgacagtgtctgttctaaatacctcacttgccatgctttgtgtgggtgatcatggaagaaaagg 

2520 attctgaaaactcaagggaacgttctttcaacctacacatcctaacttatggactcttttgatagctgatga 

2 592 ttttctttctattttttgtttttaaggaaaattgttcatctttaccaaatgaaatgccaaaggaaagttgga 

2664 aagccactggctatgctgtattttgatataataattgtactgtgttttaaattttgtatccggatttttaaa 

2 73 6 aacaaaatttcacaccatagtctatattttacttctctggcaaaatacacttttgttcttt f nr . fltatatat 

2808 Atratatat;a r. f 1 ^^ a ^ Aaa ataqqttctaaaaaaatccatactataaaaaaaaa 

2 880 aaacgtggttgactgatgttcatgaaagaa y tAaa a tgtttctctctttctctacattttaaaaaaaaa 

FIG. 3 
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MHa«2 -MHCERFLCVLR- 1 IGTTL FGVSLLLG I TAVY I VOYQriQTO KTYFSFOLYOAFL 

KHasl MRQOMPKPS EAARCCSGLARRALT 1 1 FALL I LGI>fTKAYAAGVPLAS DRYGLLAFOLYGAVL 

DG42 MKE KAAETME I PE G I PKDLE PKH PTLWRI I YYS FGWUAT ITAAYVAE FQVLKHEAI LFSLGLYGLAM 

HasA MPIFKKTLXVLSFIFLISILIYLNMYLFGTST VGI-YGVILITYLVIKL QLSF 

HodC MTLLDTTSTAAIS I - YALIJLiTAYRSMQVLY ARP I DGLAV 



HH&G2 

KHasl 

OG42 

HasA 

NodC 

MHa«2 
MHasl 
DG42 
HasA 

HodC 

MHafe2 

MHasl 

DG42 

HasA 

HodC 

MHas2 

MHasl 

DG42 

HasA 

HodC 

MHas2 

MHasl 

OG42 

HasA 

HodC 



ASHLI IQSLFAFLE HRKM KKSLETPIKLNKT VALC IAAYQEDPDYLRKCLQSVKRLTYPQIK 

SAHLVAQSI^AYLEHIUIVAAAARRSIAKGPM 

LLH1MMQSLFAFI-EIRRV NKSEL-PCSFKKT VAIiTIAGYQENPEYLIKCLEfiCKYVKYPKDK 

LYE PFKGNPHDY KVAAVI PSYHEOAESU^TLKSVLAQTYPLS - 

AAEPVETRPLP AVDVXVPSFNEDPGILSACLA8IADQDYPGE- 



** 



- -VVMV^DGNSOOOLYhft^IFSEVIGRDKSATYIHKNNFHE-KGPGET EESHKESSQHVTQ- 

U*VUCATX3NRAEDLYMVIX4FR£^^ 

LKX ILVIDGNTEDDAYMMEMFKDVFHCTDVG^^ 

-EIYIVDDGSSMTDAIQL IEEYVNRE VDICRNVIVHRS — 

LKVYVVDDGSRNREAIVR VRAFYSRD PRFSFXLLPE 

* ** * * * * * 

LVLSNKS I CIMQKHGGKREVMYTAFRALORSVDYVQVCT 

VEALVRTRRCNflCVAQRHGGKREVMYTAFKAI^^ 
VEELVFMKRCVCIMa^a-KREVMrtAFQM 

LVMK G-KRHAQAHAFERSDADY- FLTV-DSDTY IYPKALEE LLKSFMDETVYAATG 

HV -G-KRKAQIAAIGQSSGDL- VLNV-DSDSTIAET>WSKIASKMW)PEV<»VMG 



* * * 



* * 



** 



GDVQIttlKTOSWISFLSSVRYHMAFNIERACQSYP^ 

GDVRIUTCLDSWSFI^SI^afWAFNVERA^ 

G0VRXI2lPYDSFX6EMSSUOfKMftFHVERAGQS^ 

- HLNARMPQTHLLTRLTD IRYDMAFGVERAAQSLTGN I LVCSGPLS IYRREVI I PNLERYKNQTFLGLP 
^LTASNSGDTT^TKLIDMEYWIAjCNEEIU^AQSFFGA 



** 



** 



CSFGDDRHLTNKVI^I/rcATKtfTARSKCLTETPIEYL^^ 

CTFOTDRHLTNRMLSMGYATKYTSRS RCYSETPS S FLRHLSOQTRWSKSYFREOT^XKAI^TWHRH^ 

CCtXnDWBAKPML^^ 

VSIGDDRCXTNYAIDLG-RTVYQSTARO>TDW^ 

SDFCTORHLTILMLKAGFRTEYVPDAIVATVVPDTI-K^ PI* — L 



MHas2 
MHasl 
DG42 
HasA 

Mode 

MHas2 
MHasl 
DG42 
HasA 

NodC 

MHas2 

MHasl 

OG42 

HasA 

HodC 



H MTYEAVI TGFFPFFLIATVIQLEYBGKI — WNI UXIXTVQLVGUKSSFASCLRGNIV 

H MTYEAW SGLFPFFVAATVLRLFYAGRP — RALI^VIXCVQGVAIAKAAFAAWIJUJCVR 

H MTVESW SFIFPFFITATVIRLIYAGTI — WNVVWUJJCIQI*iSLFKSIYAC^^ 

HTIFEVVMFhfrlZ^AIGMUiFNO^XQI^I^KX^AFLSX IFIVALC R NVHYMVKHPAS 

RGLSPFIJ^FDAVGQMIOQUJJVZ^VVTC^^ MTXIRCSVVALHAROLR 

* * * ** ***** 

MVFMSLYSVLYMS S liLPAKMFAIATIMKAGHGTS GRKTTVVNF- IGLIPV6VHFTILLGGVIFTIYKES 

MVIJLSLYAPLYMCGtXPAKFIALVTl^SGWOT 

MIXMSLY6MZ»rrGtXPSKrFAXXTUIKIGR^ 

FLLSPLYGILHLFVXjQPLKLYSIjCTIKNTEHGTR KKVTIFK* 

FIjGFVI«TPINiri*II#PU<AYALCTLSNSOHWR YSAPEVPVS GGKQTPIQT- - 



*** * 



* * * 



•0MVLOV* 



KKPFSES-KQ TVLIVGTLIYACYHVMLX/TLYVVLXKKOGHRKKOQ^ 

RADWSOTSRAAEAYHLAAGAjGAYVAnTVVML^ 

QNDHSTFEKQKEMYHXJ«YGCVGYVMroVXMAVM -CV* 
SGRVTPDCTCSGE * 
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1 


GTCTTATTTT 


GQGTCTGTTC 


AGTCCATTAG 


TOGACCTCTG 


GGAATCTACA 


51 


GAAACTCCTT 


GTTGCATGAG 


TTTGTGGAAG 


ATTOGTACAA TCAAGAATTT 


101 


ATOGGCAACC 


AATCTAGCTT 


TOGTCATGAC 


AGGCATCTCA 


CGAACCGGGT 


151 


GCTGAGCCTC 


GGCTATCCAA 


CAAAATACAC 


AGCTCGATCT 


AAGTCCCTTA 


201 


CTGAAACACC 


TATAGAATAT 


CTCAGATGGC 


TAAAC 
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HHAS2 1 GTCTTATTTTCGGTGTCTTCAGTC 31 

IIIIIIMIMI lllll MINIM II 
MHos2 1301 atatagaaagggcctgccagtcttattttggctgtgtccagtgcataagc 1350 

• • • « • 

32 GGACX^XriXXX^TCTACAG 81 
li I I I t t I 1 I t I I t I I t I I I I t 1 I I I 1 1 I lllllll 1 t I I t 1 I I I 1 I 
1351 ggtcctctgggaatgtacagaaactccttgctgcatgaatttgtggaaga 1400 

82 TTOGTACAATCAAGAATTTATQGQCAACC^ 131 

1 1 1 1 f 1 1 1 1 1 1 iiim iiiii i j 1 1 1 1 1 1 ii 1 1 r 1 1 1 1 1 1 1 1 1 

1401 ctggtacaatcaggaattcatgggtaaccaatgcagttttggtgacgaca 1450 

• • * • ■ 

132 GGCATCTCAOGAACCQGGTXX^TGAGCXTO 181 
I I I I II II III lllll till I 1 I I I I 1 I I I 1 I I I MfKIM 
1451 ggcaccttaccaacagggtgttgagrtctgggctatgcaactaaatacacg 1500 

• • • • • 

182 GCTCXaTCTAAGTCCXrrTACT^^ 231 
lllll II 11111111111111 Ml II MMMMMI II II Mil 
1501 gctcggtccaagtgccttactgaaactcccatagaatatctgagatggct 1550 

232 AAAC 1 \ \ 235 

III 

1551 gaaccagcagacccgatggagcaagtcctacttccgagagtggctgrtaca 1600 
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HHAS2 1 SYFQCVQCISGPIXMYRNS^ 29 

1 11 1 I i I 1 1 1 1 1 H I 1 I I M I I I II I 1 1 » 
MHQS2 251 WISFLSSVRYWMAFNIERACQS^^ 300 

30 NQEFMSNQCSTOTORHLT^ . 78 

I I 1 1 II I I I I I M I I I I 1 1 I 1 1 1 1 I I M I I 1 1 I 1 1 1 1 I I I I I M I 1 1 1 1 
301 hQEFM3NQCSFX3DDRHI7ITJRVL^ 350 
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MHas3 1 GTCXrnvCTTXGGCTCnC^ 50 

Mlllllf II IIIIIMIII f ) IIMINI! II M ! I I 1 I M I I I I 1 1 
HHAS3 1 GTOCTACTmXX^G^ 50 

• • * • • 

51 GCAAC^GCCTCCTTCAGCAff 100 

111 I I I I II I I I I I II 111 I I I I Mil IN I I M I II III II MINI 
51 GCAACAGCCTCCTKXAGCAG^ 100 

• * « » • 
101 CTAGGCJUjCAAGTGCAGCTTTGGGGA^ 150 

1 1 III I II 1 1 II 111 I M 1 1 Mllllll MINIM 1 1 1 1 1 1 1 1 1 ■ 1 

101 CTAGGCAGCAAGTGCAQCTTOGQGGATC^ 150 

151 CCTGAGTCTTQGCTACOQGACTAAGTAT^ 200 

llllll 1 1 1 1 1 1 1 1 1 f I MIMIMIM II illlt IIIIIMIII 
151 CCTGAGCCTTCXXTAOOGA 200 

• • • 

201 CAGAGACOCCX^TAGGTACCTI^^ 235 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 f (Mill II 1 1 1 1 1 1 1 1 
201 CAGAGAOCCCXZACTAAGTACCTX^CQGT^ 235 

FIG. 11C 



HHAS3 1 SYPaVQCISGFIX3H¥HNSI^^ SO 

Mil llllll III II HUM I MM III I III lllll MM II MM it 
MHas3 1 SYFGCVQCISGPLGMYRNSUX^^ 50 

51 I^LOYIOTCYTORSKCLTETPTKY^ 78 

MMMMII 11111111111:111111 
51 LSIXTnOTCYTARSKCLTETPTRYLRWLN 78 
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1 f ATGCCGGTGCAGCTGACTACAGCXCTGCGTGTGGTGGGCACCAGTCTC 

MPVQLTTALRVVGTSLFALVVL 22 
6 7 GGAGGCATCCrrGGCGGCCTATGTGACAGGCTACCAGTTTATCCACAC 

GGILAAYVTGYQFI HTEKHYLS 44 
133 TTTGGCCTCTACGGTGCCATCCTGGGTCTACATCTGCT 

FGLYGAILGLHLLIQS LFAFLE 66 
199 CACCGTCGAATGCGCAGGGCAGGGCGCCCCCTCAAGCTGCACTGCTCCCAGAGGTCGCGTTCAG^ 

HRRMRRAGR PL KLHCSORSR S V 88 
265 GCACTCTGCATTGCTGCCTACCAAGAGGACCCCGAW 

ALCIAAYQEDPEYL RKCLRSAO 110 
331 CGCATTGCCTTTCCAAACCTCAAGGTGGTCATGGT 

£IAFPNLKVVMVVDGNRQEDTY 132 
397 ATGTTGGACATCTTCCATGAGGTGCTGGGTGGCACTX5 

MLDrFHEVLGGTEQAGFFVWRS 154 
463 AATTTCCATGAGGCGGGTGAAGGAGAGACAGAGGCCAGCCTGCAGGAAGGCATGGAGCGTGTGCGA 

NFHEAGEGETEASLQEGMERVR 176 
529 GCTGTGGTGTGGGCCAGCACCTTCTCATGCATCATGCAGAAGTC 

AVV^WASTFSCIMQK W^G G K R E V M 198 
595 TACACTGCCTTCAAGGCCCTTGGCAACTCAGT 

YTAFKALGNSVDYIQVCDSDTV 220 
661 CTGGACCCAGCCTGCACCATTGAGATGCTrCGAGTCTTGGAAGAAGATCCCCAACT 

L D P A C*T I E M LRVLE E D PQVGGV 242 
727 GGAGGAGATCTCCAJ^TCCTCAACAAGTATGATTCATGGATCTC 

GGDVQILNKYDSWI S FLS SVRY 264 
793 TGGATGGCTTTCAACGTGGAGCGGGCCTGCCAGTCCTACT 

WMAFNVERACQSYFGCVOCISG 286 
859 CCTTTGGGCATGTACCGCAACAGCCTCCTTCAGCAGTTCCTC 

PLGMYRNSLL QQFLEDWYHQKF 308 
925 CTAGGC^GCAAGTGCAGCTTTGGGGATGATCGG CACCTT ACCAACCG AGTC CTGAGTCTTGGCTAC 

LGSKCSFGDDRHLTNRVLSLGY 330 
991 CGGACTAAGTATACAGCACGCTCTAAGTGCCTC^ 

RTKYTARSK CLTET PTRYLRWL 352 
1057 AATCAGCAAACCCGCTGGAGCAAGTCTTACTTTCGGGAATC 

NQQTRWSKSYFREWLYNS LWFH 374 
1123 AAGCACCACCTCTGGATGACCTATGAATCAGT^ 

KHHLWMTYESVV TGF FPFFLIA 396 
1189 ACAGTCATACAACTTTTCTACCGTGGCCGCATCTQGAACATTCT 

TVIQLFYRGRIWNI LLFLLTVQ 418 
1255 CTGGTGGGCATTATCAAGGCTACCTATGC 

L V G I I KATYACFttR GNAEMIFM 440 

1321 TCCCICTACTCCCTTCTCTATATXr^ 

SLYSLLYMSSLLPAKIFAIATI 462 

1387 AACAAGTCTGGCTGGGGCACTTCTGGCAG 

N K5GWGTSGR KtlVVNFIGLlP 484 
1453 GTGTCCATCTGGGTGGCA G TTCTT CTAGGGGGGTTAGCCT ACACAGCTT ATTGCCACG ACCTGTTC 

V S IWVAVLLGGLAYTAYCQDLF 506 
1519 AGTGAGACCGAGCTAGCCTTCCTAGTCTCTGGGGCCATCCTGTATGGCT 

SETELAFLVSGAILYGCYW.VAL 528 

1585 CTCATGCTGTATCTGGCCATTATTGCCCGGAGGTGTGGGAAGAATC 

LMLYLAIIARRCGKKPEQYSLA 550 

1651 TTTGCGGAGGTGTGA 

F A E V 554 
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DG42 
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- -MPVOLTTALR-WGTSL FALWLGG I LAAYVTGYQFI HTEKHY1-SF0LVGAJ: L 

-MHCERFLCVLR- 1 ICTTL FGVSLLUGITAAy IVGYQFIQTDNYYFSPGLYGAFL 

-«-MRQWPOTSEJVARCCSGI*ARRALTI I FAIXII/2*m<AYAAGVPt^DRYG 

MKEKAAETMEIPEGIPKDI^PKHPTLWRI IYYSFGV\^I^TITAAYVA£FQVI*KHEAILFSLCLYGI-AM 
MPIFKKTLIVLSFIFLISILIYLNMYLFGTST VGI-YGVILITYLVIKL GLSF 
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MHas3 
MHas2 
MHasl 
DG42 
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MHas2 

MHasl 

DG42 

HasA 



MHas3 

MHas2 

MHasl 

DG42 

HasA 



MHas3 
MHas2 
MHasl 
DG42 



GI*HLLIQSI*FAFLEHRRM WU^-GRPUO-HCSQRSRSVAI^IAAYQEDPEYXR^^ 

ASHMIOSI-FAFLEHRKM KKSLETPIKLN KTVALC IAAYQEDPDYLRKCI43SVKRLTYPG 

SAnVAMZiFAYLEHRRVAAAARRSLAKGPLOAATA- -RSVALTISAYQEDPAYLRQCX-TSARALLYPH 

IXHLMMMLFAFLE IRRV NKSEL-PCSFK KTVAI-TIAGYQENPEYLIKCIXSCKYVKYPK 

LYEP^^PHDY K - VAAVT PS YHEDAE S I*LE TLKSVIAQTY PL 

* ** * ** ■ * * 

U£ vVMVVIX3NRQEDTYMIX)IFHEVLGGTEQAGFFVW^ EASLQEGMERV 

tit VVMVIDGNSDDDLYMMDIFSEVMG3U)K-SATYIWKNNFHE~KGPGET EESHKESSQHV 

TRIJIVailVVDGHRAEDLYMVTDMFMVFADED- PATYVWDGNYHQPWE PAEATGAVGE GAYRE VEAE D PG 
DIOJCIILVIDCWTEDDATMMEMFKDW 

S—EIYXVDOGSSNTDAIQL IEEYVNRE VDXCRNVIVHR 



* * * 



*** * 



*# 



HA VWASTFSCIMQKWGGKREVMYTAFKALGNSVDYIQVCT 

XD _ Lvi^NKSICIMQKWGCaCREVMYT*^^ 

RJ^VEALVRTRRCVCVAQRSTOGKREVMYTAFKA^ 

I NMVE E L.VRNKRCVC IMQQHGGKREVMYTAFOAI GTSVDYVQVCDSOTKLDE LATVEMVKVL£SNDMYG 

s LVNK G-KRHAOAWAFERSDADV-FLTV-DSDTYIYPNA1-EEIXKSFNDETVYA 

# • * ** * *** * 

WGGDVQILNKYDSWISFLSS^ 
WGGOVOIUIKYDSWISEMSVFYWM^ 
AVGGDVMUiPU>SWSFI^SIJlYWVAF*^ 
AVGGDVRIIltPYOSFTSEMSSUim<M>^^ 

ATG-HLKARMRQTNLX.TRLTD IRYDNAPGVERAAQSI-TGNI LVCSGPU5 X YRREVI I PNtXRYKNQTFL 
* * * ***** # * ***• * 



GS 



GTHCTFGDDRHLTfcWMLSMGYATKYTSRSRCT 
GTYCTU»ORHLTNKVI^<^^ 

GLPVSIGDDRCLTtfTAIDI/^RTVYQSTARCDTO^ * ISVKKILSNP I 

* ** * # ** * * * 

W MTYESW TGFFPFFLIATVTQLFYRGRI - -WNIIXFIXTVQT-VCI IKATYACFLRG 

K MTYEAVI TGFFPFTXIATVTQLTYRflKI - -WNII*LFIXTVQI.VCLIKBS FASCLRG 

w - KIYEAW s(aOTFFVAATVIJtf«A<3lP--WAIJ3f^^ 

- - -H MTYESW SFI FPFFITATVTRIilYAGTI - -WNVVHLLLCIQIWSLFKS I YACWLRG 

VAlilTlFEVVMFMMLIVAIGNlXSTIQAlQt*DL2KLPAFl*SI IFIVALC R-MVHYMVKH 

* ** * * ** ***** ** 

KAEMI FMSLY6LLYMS SZXPAKX FAIATINKSGWGTSGRKT ZWHFXGLX PVS I WVAVLI/3GIAYTAY- 

MIVMVn^YSVT-YMSSIXPAKMEAIATINKACWGTSO 

(^HWI^LYAPLYMCGIXPAKFLALVTt^SGR 

HFlMLIKSI.Y8MLYMrGIXPSKYEAIXTUII^^ IYM 
PASFlXSPl*YGILHLEVljQPIJCLY8IiCTIKNTEWGTRKKVTIFK« 

* * * ******** 



• ** 



-CQDLFSET ELAFLVSGAI LYGCYWVALLMLYLAI 1ARRCO- - KKPEQYS LAF AEV • 

ESKKPFSES-KQ TVL I VGTL I YACYWVMLLTIdfVVI, I - NKCGRRKKGQQYDMV- LDV • 

EARADWSG PS RAAEAY HLAAGAGAYVAYWWMLT I YWVGVRRLC- RRRSGG - YRVQV • 
DCQNDWSTPEKQKEMYHIXYGCVGYVhnrWVIMAVMYWWVKR I PDMCV • 

+ * * ** *** * * * * 
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2 Claims No*.: 
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| | Claims Noa.: 
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Subject 1) Claims 1-7, 16,18,20,21,24,28,29,32,34,36,38,42,44 and 45 and 

partially 26,27,40 and 41 
Subject 2) Claims 8-15,17,19,22,23,25,30,31,33,35,37,39,43,46 and 47 and 

partially 26,27,40 and 41 
FOR FURTHER INFORMATION SEE CONTINUATION SHEET PCT/ISA/210 

1 As all required additional search fees were timely paid by the applicant, this International Search Report covers all 

^— * se arena ble claims. 

2. | I As all searchable claims could be search* d without effort justifying an additional fee, this Authority did not invite payment 
or any additional lee. 



As only some of the required additional search lees were timely paid by the applicant, this International Search Report 
— ' covers only those claims tor which tees were paid, specifically claims No*. : 



| X | No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Noa.: 

Claims of subject 1 
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[ | The additional search fees were accompanied by the applicant's protest. 
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1. Claims: 1-7, 16, 18, 20, 21, 24, 28, 29, 32, 34, 36, 38, 

42, 44 and 45, and partially, 26, 27, 40 and 41 



DNA molecule encoding hyaluronan synthase-2, expression 
cassette and host cell comprising the same, uses thereof for 
producing hyaluronan synthase-2 and for altering the amount 
of hyaluronan produced by a cell; hyaluronan synthase-2 
polypeptide, use thereof for treating a condition associated 
with alteration in hyaluronan synthesis or extracellular 
accumulation; method for identifying a maninal affected by 
aberrant hyaluronan synthesis or extracellular accumulation 
by using a mammalian hyaluronan synthase-2 binding agent or 
an hyaluronan synthase-2-specific oligonucleotide; method 
for detecting hyaluronan synthase-2 DNA; use of an agent 
effective to alter native hyaluronan synthase-2 activity in 
a therapeutic method; method to prepare hyaluronan by using 
hyaluronan synthase-2. 



2. Claims: 8-15, 17, 19, 22, 23, 25, 30, 31, 33, 35, 37, 39, 

43, 46 and 47, and partially 26, 27, 40 and 41 



DNA molecule encoding hyaluronan synthase-3, primer or probe 
thereof, expression cassette and host cell comprising such 
DNA molecule, uses thereof for producing hyaluronan 
synthase-3 and for altering the amount of hyaluronan 
produced by a cell; hyaluronan synthase-3 polypeptide, use 
thereof for treating a condition associated with alteration 
in hyaluronan synthesis or extracellular accumulation; 
method for identifying a maninal affected by aberrant 
hyaluronan synthesis or extracellular accumulation by using 
a maninal ian hyaluronan synthase-3 binding agent or an 
hyaluronan synthase-3-specific oligonucleotide; method for 
detecting hyaluronan synthase-3 DNA; use of an agent 
effective to alter native hyaluronan synthase-3 activity in 
a therapeutic method; method to prepare hyaluronan by using 
hyaluronan synthase-3. 
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